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INHmrrORS of staphylococcus SarA PROTEIN FUNCTION INVOLVED IN 
THE EXPRESSION OF STAPHYLOCOCCAL VIRULENCE FACTORS AND THE 
USE THEREOF IN TREATING STAPHYLOCOCCAL INFECTIONS 

RELATED APPUCATIONS 

This application is a continuation-in-part of U.S. Serial No. 60/142J93 filed on 
July 8, 1999, and which is incorporated. in its entirety by reference. 

BACKGROUND OF THE INVENTION 

5 Field of the Invention 

The present invention relates to inhibitors of Staphylococcus SarA protein function 
involved in the e3q>ression of staphylococcal virulence factors and the use of these inhibitors 
to treat and prevent staphylococcal infections m subjects. Particularly, the inhibitors act to 
interfere with the binding of the SarA protein to its bmding site(s). The selection of specific 
10 inhibitors of the SarA protein is made possible as a result of the identification of the bmding 
sites of SarA protein on at least a portion of the agr (accessory gene regultor) gene, a gene 
that like the sar (st2q[)hyIococcal accessory regulator) gene, plays a role in the virulence of 
Staphylococcus. 

There is a great and urgent need among infectious-disease specialists, who have 
IS begun seeing one of their worst nightmares come true. They may be losing their last line of 
defense against the dangerous pathogen Staphylococcus aureus (5. aureus), which causes 
infections ranging firom skin abscesses to such life-threatening conditions as pneumonia, 
endocarditis, septicemia, and toxic shock syndrome. Roughly one-third of the strains 
currentiy isolated from patients who acquire 5. aureus mfections while hospitalized are 
20 resistant to all antibiotics but one, vancomycin and now resistance to that antibiotic is 
cropping up. The present invration provides a new approach to combating S. aureus that 
may sidestep the organism's ability to develop resistance. 

Despite intensive research efforts over the past SO years, &aphylococcus^ 
particularly ^aphylococcus aureus^ rraiains a serious threat to human health. In fact, recent 
2S reports describe clinical isolates with reduced susceptibility to vancomycin. Therefore, 5. 



wo 01/03686 



PCTAJS00/18S2S 



aureus represents a bigger threat to human health now, than at any time since the pre- 
antibiotic era. 

Staphylococcus is an opportunisitic bacteria that takes advantage of 
immunoconq)romised subjects and may become pathogenic in these subjects. There are 
S approximately thirty-two species of Staphylococcus with only three consistently causing 
human disease. S. aureus is clearly the most prominent disease causing species, followed 
by 5. epidermidis, and in a distant third is 5. saprophyticus. S. epidermidis is l)ecoming 
more prominent as a disease causing species because it causes infections of in-dwelling 
medical devices. As a result, researchers are looking more carefully at 5. epidermidis, and 

10 as a result of this research, have found homologs of both the sar and agr genes in 5. 
q}idermidis. Huckiger, U., etal. (1998). OttoM ^etal. (1998), respectively. 

S. aureus can cause a diverse anay of diseases ranging from relatively siq^erfidal 
mfections of the skin (boils) to infixtions of die eye (endoptfialmitis) to life Areatening 
osteomyelitis, endocarditis and toxic shock syndrome (reviewed by Frojan and Novick, 

IS 1997). S. aureus is armed with a large battery of virulence factors that raable it to colonize 
a human host and cause a variety of disease states (reviewed by Projan and Novick, 1997). 
Nosocomial infections are of particular concern for two reasons. The first is that the 
majority of life-threatening infections arise in the hospital environment. For exanq)le, while 
the frequency of S. aureus infections incurred during orthopedic or cardiac implant surgery 

20 is steady, the overaU nunlber of infections has risen dramatically in the past decade. This is 
largely due to the increase in the frequency of these procedures. S. aureus has an amazing 
capacity to colonize in-dwelling prosthetic devices. The second reason for increased 
concern of 5. aureus infections is that strains of methicillin-resistant ^aphylococcus mreus 
(MRSA) are endemic in hospitals. Moreover, strams with some resistance to vancomycin 

2S emerged in the United States in 1997 (Tenover et al., 1998; Smith et a/., 1999; Sieradzki et 
at., 1999). Therefore, the need tor new, eflfective treatments for this drug resistant 
pathogen is urgent. 

The variety of virulence factors expressed by 5. aureus contribute to a highly 
efficient system for siurvival. Early in the infection, surface proteins are predominantly 
30 expressed. Protein A and the adhesins {e.g., collagen, fibronectin) are rq)resentative 
surfiEu:e protems that solve two problons for the 5. aureus cell. Furst, they bind to 
extracellular matrix components and anchor the cell to ttie host tissue. Second, Hxty provide 
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a host protein camouflage which helps the infecting cell elude the host's immune system. 
The nascent colony increases in size until a oitical number of cells is achieved (quorum) 
and a switch is thrown to re-organize the expression of virulence factors from surface 
proteins to exoproteins. These latter factors contribute to sequestration of the colony within 
S a protective biofllm and enzymatic degradation of host tissue with an army of digestive 
enzymes, such as nucleases, lipases and proteases, which eventually result in an abscess. 
These enzymes accomplish two important functions or the bacterium: (1) allowing space for 
growth of the colony by getting rid of host tissue and (2) digested host tissue is assimilated 
by the bacterial cells for growth. Deep-seated abscesses, such as those found in 

10 staphylococcal ostemyelitis and endocarditis, often require surgical intervention to 

remediate the disease. It is inq[)ortant to note that this phenotypic switching process can be 
largely recapitulated in the laboratory ravironment, witti surfEice protein expression 
occurring in die early log phase of a culture^s growth and exoprotein expression occurring 
late in log and into the stationary phases of growth. 

IS The potency of this pathogen can be attributed to the coordinated, tenq)orally- 

regulated expression of a wide array of virulence factors. Early in infection expression of 
surface proteins predominates, e,g.^ the collagen and fibronectin adhesins and protein A. 
The surface proteins allow the organism to attach to host tissues and evade the inunune 
system. However, when the concentration of S. aureus cells at the site of infection 

20 becomes high, surface protein expression is reduced and exoprotein expression increases. 
The temporal regulation of surface protems and exoproteins can be recapitulated in 
' laboratory culture growth models, where early log phase growth represents an early 
infection and stationary phase rqpresents late infection. Using this model system and 
classical genetics, two major pleiotropically-acting regulatory loci that govern traiporal 

25 ei^ession of surface proteins and exoproteins have been identified: agr^ for accessory gene 
regulator (Recsei ei al., 1986; Morfeldt ei al., 1988; Peng et al., 1988) and sar, for 
staphylococcal accessory gene regulator (Cheung et al., 1992; Cheung and Projan, 1994). 
Mutations in these loci result in aberrant regulation of most virulence factors (e.g., lipase, 
coagulase, a-toxin, adhesins, etc), whidi is reflected in diminished virulence in animal 

30 models of staphylococcal disease (Ptojan and Novick, 1997). 

A sdieme dq»icting the agr locus and its encoded proteins is shown in Figure 1. 
Divergent promoters (P2 and P3), sqparated by approximately 180 bp, are responsible for 
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transcription of the agrBDCA operon and RNAJSUhld operon (Morfeldt et al., 1996). The 
four Agr proteins combine to make a quorum-sensing system that is homologous to many 
two-conqponent signal transduction systems found in prokaiyotic organisms (Ji et al., 
1997). AgrB is a cell membrane-bound transporter/processor of the AgrD peptide. AgrD is 
5 a 46 amino acid peptide that is cleaved to an octapeptide pheromone, exported by AgrB, 
and specifically recognized by (Ji et al., 1997) the AgrC membrane-bound receptor. The 
AgrD octapeptide pheromone allows an 5. aureus cell to signal its presence to other cells in 
the growing colony. As the colony grows, the concentration of pheromone (Agr D) 
increases and reaches a particular level. AgrC» also an integral membrane protein, is 

10 activated by pheromone binding. AgrC is tiiought to be a kinase that acts on AgrA by 
initiating a signal transduction patiiway that is believed to include AgrA. Whereas die exact 
mechanism of AgrA action is unknown, it is inqx>rtant for up-regulation of virulence gene 
expression and is diougbt to activate expression of the agr operon (RNAII) and tfie 
divergendy expressed RNAm. It is clear, however, firom the work of Arvidson's group 

IS (Morfeldt et al.^ 1996) that AgrA does not bind DNA either in the presence or absence of 
SarA. Mutations in any of the agr open reading frames (ORFs A, B, C, D) eliminate the 
up-regulation of RNAII and RNAIII expression (Novick et al., 1995). Additionally, agrA 
mutants have dramatically reduced virulence in animal models of staphylococcal arthritis, 
osteomyelitis, endocarditis and endophthalmitis (Abdelnour et al,, 1993; Cheung et al., 

20 1994a; Gillaspy et al., 1995; Booth et al,, 1995). RNAIII is a regulatory RNA species, the 
function of which is not completely clear. However, there is evidence that RNAm directiy 
regulates es^ression of some 5. aureus virulence genes by an anti-attrauation mechanism 
(Novick et aL, 1993; Saravia-Otten et al., 1997). 

In summary, when 5. aureus attadies to a host tissue, a small amount of the AgrD 

25 pheromone is released. Early on, the concentration of the pheromone is too low to affect 
AgrC kinase activity* However, once the number of cells has risen, the local concentration 
of pheromone increases to a level whereby AgrC becomes activated. At this point the 
phenotypic switch is thrown and exoprotein expression dramatically increases. 

Mutations in agr leads to decreased expression of exoprotein virulence factors and 

30 significantiy reduced virulence in animal models of staphylococcal arthritis, endocarditis, 
osteomyelitis and endophthahnitis (Abdelnour et al., 1993; Cheung et al., 1994a; Gillaspy 
et al., 1995; Booth et al., 1995, respectively). Inhibition of the agr quorum- 
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sensing/virulence gene activating system is a goal of die present invention. Since agr is 
activated by a transcriptional regulator, SarA, the present invention is directed to inhibiting 
this protein. 

The second regulatory locus, sar^ encodes a 14.4 kDa protein: SarA, also depicted 
5 in Figure 1. Mutations in sar^ like those in agr, lead to dramatically decreased virulence in 
animal models of staphylococcal disease (Cheung et al., 1994a). Interestingly, agr\ saf 
double mutants are less virulent than either of the single mutants in staphylococcal 
endocarditis, endophthalmitis and osteomyelitis (Ch^ng et al,^ 1994b; Booth et al.y 1997; 
GiUaspy, et al,, unpublished). Presumably, this phenotype is because SarA regulates 

10 expression of both transcripts in the agr locus {agr and RNAIII; Cheung et al.^ 1997) and 
SarA also regulates virulence fiactor genes that fell outside of agr control. For example, the 
aia gene, encoding the collagen adhesin, is not affected by mutations in the agr locus, but 
is under sar control (Gillaspy et al., 1998; Blevins et al., 1999), agr mutants do not have 
altered sar mRNA accumulation, whereas agr mRNA expression is dramatically affected by 

IS sar mutations (Cheung et al.^ 1997; Gillaspy and Smeltzer, uq)ublished). Specifically, 
there is a significant diminution of agr mRNA and nearly a conq>lete loss of RNAin in the 
saf strain ALC136 when conq)ared to the wild type stram RN6390 (Cheung et al., 1997). 
The same observation has been made in clinical isolates in which the sar gene has been 
mutated (Gillaspy and Smeltzer, unpublished). 

20 SarA present in crude extracts of 5. aureus (Morfeldt et al., 1996) or recombinant 

SarA in E. coli extracts (Heinridis et al.y 1996) or purified, recombinant SarA (Chein and 
Cheung, 1998; U.S. Patrat No. 5,387,288; Rechtin et al, 1999,) have been shown to bind 
the DNA region of the agr promoters (this region also is referred to herein as the agr 
enhancer). AgrA has not been shown to bind DNA and was not present in the SarA-a^r 

25 enhancer complexes (Morfeldt et a/.,1996). 

Arvidson and his colleagues recentiy reported that SarA is a DNA-bmding 
regulatory protein and that its binding sites were located to the P2 and P3 promoters in 
die agr locus, a region that is referred to as the agr enhancer (Morfeldt et al., 1996). The 
agr enhancer has mverted repeats of a 7 bp sequence 5'-CTTAAGT-3' (Figure 2). 

30 (Qualitative electrophoretic mobility shift assays (EMSA) were used to examine this region 
for regulatory proteins that may bind. Crude extracts of 5. aureus wild type, sar and agr 
mutants were prepared. DNA fragments containing the left half of the r^on, right half of 
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the region and the entire region were used in the EMSA tests. The migration of all of these 
fragmrats was retarded in native gels containing wild type and agr' extracts, but not sat 
extracts. Furthermore, DNA affinity chromatography was used to purify the regulatory 
protein species and one protein with a MW of approximately IS kDa was recovered. The 
5 amino acid sequence of the first 20 residues of that protein was determined and it matched 
that of the protein encoded by the sar gene. A simple model for SarA-mediated activation 
of genes in the agr locus would have Sar A protein binding to a site that includes the hq)tad 
repeats and facilitating the binding of RNA polymerase to the adjacent promoters. 
However, complicating any sunplified model is recent data from Rechtin ei al, (1999) that 
10 unambiguously shows using high-afBnity DNase I foo^rint analysis that the heptads are not 
die primary bmding sites for SarA. Ratfan SarA protected dnree distmct, bipartite sites from 
DNase I digestion at extremely low protein concentrations, indicting very high affinity 
binding. 

Production of the three distinct transcrq)ts arising from die sar operon are regulated 

15 tenq)orally (Bayer et aL, 1996; Blevins et al,, 1999). However, all three transcripts 
mclude the SarA ORF. Like agrA mutations, transposon insertions in the SarA ORF also 
eliminate induction of RNAII and RNAIU in late phase growth and result in reduced 
staphylococcal virulence in animal models of disease (Cheung et al., 1994 a and b; Booth et 
al.^ 1997). In seminal, biochemical work in this area, SarA was shown to be a DNA- 

20 binding protein that is cq>able of binding DNA fragments containing ds regulatory 
elem^its for the promote of both the agr operon (RNAII, P2 promoter) and the RNAm 
operon (P3 promoter) (Morfeldt et a/., 1996). Heptad repeats were idmtified iq)stream of 
both P2 and P3 promote and were proposed to be SarA binding sites (see Figure 2 and 
Figure 3). A DNA fragment containing the RNAm gene and 93 bp upstream of the 

25 transcription start site, including the heptad rq>eats, was sufficient for regulated expression 
of RNAm (see pEXOSS in Figure 3). Furthermore, removal of the distal half of the 
sequences upstream of the P3 promoter, including one heptad, eliminated appropriate 
expression of RNAm (see pEX082 in Figure 3). In addition, a synthetic DNA fragment 
including the repeats was bound by SarA in electrophoretic mobility shift assays (EMSA) in 

30 vitro using 5. aureus extracts and was used successfully to purify SarA from extracts by 
DNA-affinity diromatography (Morfeldt et al. , 1996). 
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In a more recent rq)ort, SarA, expressed as a GST-fusion protein in E. coU and 
purified, was observed to have relatively low affinity for DNA fragments containing the 
heptad repeats (Chien and Cheung, 1998). Furthermore, DNase I footprinting revealed a 
primary binding site for SarA in the inter-promoter region (see Figure 3) that did not 
5 include the hq)tad repeats in the fragment d5 to the P3 promoter shown to be sufficient for 
impropriate expression of RNAIU by Morfeldt et al. (1996). These two published reports of 
SarA interactions with the agr region have inconsistent conclusions regarding the binding 
sites for SarA in the agr locus (Morfeldt et al.^ 1996; Chien and Chrang, 1998). In the 
earlier rqx>rt, approximately 60 bp upstream of the P3 promoter was shown to be sufficient 

10 for appropriate, regulated expression of RNAm expression in S. aureus (Morfeldt et al,^ 
1996). In addition, Mor^dt et al. (1996) proposed that SarA likely interacts with die 
regulatory regions containing the 7 rq>eals immediately iq)Stream of the P3 and P2 
promoters (see Figure 2). However, in die latter report a DNase I footprint in die inter- 
promoter region using a recombinant fusion of GST-Sar A was observed that had no overlap 

IS with the regulatory region described in the early repon (Chien and Cheung, 1998). 

The results of Chien and Cheung (1998) reporting an in vitro study of Ss^A-agr 
interactions may not be indicative of true Sar A-agr interactions because of the nature of the 
protein used for the study. A GST-Sar A fusion was used for most of the work and it 
appears that this construct most likely yielded an inactive SarA protein for two reasons: 1) 

20 After calculations of die concentrations of protein used in the EMSA and DNase I 
experiments reported in Chien and Cheung (1998), it became clear that micromolar 
amounts of SarA were required to adiieve mobility shifts or footprints, respectively. This 
is a very high concentration of a DNA-binding protein. In the present invention, picomolar 
concentrations resulted in similar shifts in EMSA or footprints in DNase I protection 

25 experin^ts. Thus, the SarA protein used in the present invration Bppe&rs to be several 
orders of magnitude more active than the protein used by C^en and Cheung. 2) Most 
importantly, to examine the monomer - dimer equilibrium for SarA, fluorescence 
anisotropy was used (Fernando and Royer, 1992; Maleki et al., 1997). For this work a 
fluorescent dye is coupled to the protein, typically the amino terminus. Huoroscein, 

30 fluorescein with a six-carbon spacer and dansyl, indq)endendy, were coupled to the amino 
terminus of SarA, and in each case, a rapid inactivation of SarA was found as a result of 
unfolding (Hurlburt, unpublished). These dyes have been used in anisotropy eiqperiments 
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with several proteins previously without any problems (Maleki et al.^ 1997; Hurlburt 
unpublished). Thus* it was concluded that the amino terminus of SarA is intimately 
involved in SarA*s structural integrity and resultant activity. The determined low- 
resolution structure of SarA in the present invention shows that the amino terminus is 
5 proximal to the most likely DNA-binding domain of the protein. The SarA-GST fusion 
used by the Cheung group was to the amino terminus and it is strongly suspected that the 
low activity of the protein used in those studies was a result of the fusion. 

In recent work* the quorum sensing mechanism encoded by the agr operon was 
inhibited and mice became protected from infection by 5. aureus (Balaban et. aL, 1998). 

10 This group approadied die problem by targeting the quorum sensing signal molecule and its 
recq>tor. They looked at inhibitors of agr activation; i.e., the octapq>tide pheromone 
and/or the 36 kDa protein. However^ Ridiaid Novick*s group rq)0]ted that Ais kind of 
approach has a serious lunitatioii (Ji ei al.^ 1997). Namdy, strains of S. aureus can be 
grouped based on whether they activate or inhibit the quorum sensing system encoded by 

IS agr. In other words* all strains autologously activate their own agr expression, but RN6390 
inhibited agr expression in strains SA502, RN7843, RN8462 and RN8463, and vice versa 
in a process they termed bacterial interference. This likely is due to co-evolution of the 
signal molecule and its receptor. Apparently, the signal molecule from one strain inhibits 
the receptor of another, non-conq)atible strain. Thus, the Balaban approach may have 

20 limited efficacy against some strains of 5. aureus. 

The present invention approaches the treatment of st^hylococcal virulence and 
infection differently than previous publications by the mhibitmg the activation of agr gene 
expression by inhibitmg SarA function resulting in the inhibition of the expression of 
staphylococcal virulence fectors. The present method of treatment provides a way to 

25 attenuate st2q)hylococcal virulence which is believed to be more widely qyplicable than the 
Balaban mhibitor. This is so because SarA and its target DNA sequences cis to agr do not 
suffer as much strain variability as the Balaban inhibitor. Moreover, since the agr locus 
also includes RNAHI, a known regulator of virulrace gene translation, inhibiting agr gene 
expression will have a more profound effect than inhibition of the quorum sensing system 

30 alone as disclosed by Balaban. 

The present invention provides a novel method of treating stq>hylococcal diseases 
by interfering with the production of virulence foctors, which in turn* prevents the 
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Siaphylococcus species from becoming a potent pathogen. The present invention is directed 
to designing, synthesizing and identiiying potent inhibitors of SarA function and using these 
inhibitors to treat staphylococcal infections. 

SUMMARY OF THE INVENTION 

S The present invention is directed to inhibitors of staphylococcal SarA protein 

function involved in the expression of staphylococcal virulence factors and the use of these 
inhibitors to treat and prevent staphylococcal mfections in subjects. Particularly, the 
inhibitors act to interfere with the binding of the SarA protein to its binding site(s). The 
selection of specific inhibitors of the SarA protein is made possible as a result of the 

10 identification of the binding sites of SarA protein on at least a portion of the agr (accessory 
gene regulator) gene, a gene that like the sar (staphylococcal accessory regulator) gene, 
plays a role in the virulence of staphylococci species. 

The present invention further is directed to a method of identifymg inhibitors of 
SarA function involved in the expression of staphylococcal virulence genes conprising a) 

IS contacting a candidate inhibitor with at least one SarA binding site of the agr locus in 
solution to allow the binding reaction to equilibrate for a sufficient period of time; and b) 
assessing the binding of said candidate inhibitor to the SarA binding site of the agr locus. 

The present invention further is directed to a method of identifying inhibitors of 
SarA function involved in the expression of staphylococcal virulence genes comprising a) 

20 contacting a candidate inhibitor with SarA in solution to allow the candidate inhibitor to 
affect the ability of SarA to bind to at least one SarA bmding site of the agr locus; b) 
contacting said solution of stq> a) with at least one SarA binding site of the agr locus eidier 
simultaneously with the contact of said inhibitor and die SarA or subsequently to the 
contacting of the inhibitor and the SarA; and c) assessing the inhibition of the candidate 

25 inhibitor on the SarA binding to the SarA binding site of the agr locus. 

BRIEF DESCRIPnON OF THE DRAWINGS 

Figure 1 depicts a scheme of SarA and Agr roles in virulence gene expression. 
Four Agr proteins comprise a quorum SCTsmg system. RNAm regulates virulence gene 
expression at die translational level and SarA is responsible for activation of agr and 
30 RNAm expression. 
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Figure 2 depicts a sequence of the agr enhancer regions between the P2 and P3 
promoter. The heptad rq)eats reported by Morfeldt et al. (1996) are shown in the boxes. 

Figure 3 depicts the sequence of the P2-P3 promoter region, foo^rints from DNase 
I studies and DNA fiagments used in Che present invention. The diagram shows the DNA 
5 sequence of the P2 and P3 promoters and the intervening region. The protected areas from 
the DNAse I foo^rinting experiments are Indicated by the boxes and are labeled Al, A2, 
Bl, B2, CI and C2. The regions corresponding to nucleotide sequences used for 
quantitative EMSA are indicated by the lines above the sequence and are labeled A1/A2, 
B1/B2, P2, P3 and AC. The heptad rcpesHs reported by Morfeldt et al. (1996) are shown 
10 in boldfece type. The DNase I fi>otprint reported by Chien and Cheung (1998) is 
underlined. The iq>stream ends of the RNAin/P3 promoter constructs, pEX082 and 
pEXOSS, used by Morfeldt et al. (1996) for complementation studies are indicated below 
the sequence. 

Figure 4 dq)icts the purification of SarA from E. coli Coomassie Blue stained gel. 
15 Pre vs. post induction, soluble supernatant vs. insoluble, soluble vs. insoluble in 70% 
ammoniimi sul&cte, pooled fractions after hq)arin-Sepharose chromatography. 
Figure 5 depicts the chemical crosslinking of SarA. 

Figure 6 depicts DNase I protection analysis of Sar A interactions with the P2-P3 
promoter region. A, Representative DNase I result. ^P-labeled DNA fragment was 

20 equilibrated with various concentrations of Sar A prior to treatment with DNase I. Samples 
were resolved by denaturing gel electrophoresis and detected by phosphorimaging. A/G, 
G, chemical sequencing standards, solid bars on left side {e.g., 1P3, 2P3, ^.) indicate the 
position of the heptad repeats, solid bars on right side e.^., Al, A2, etc.) indicate protected 
regions, and the arrows indicate hypersensitive sites. B. Alignment and consensus 

25 sequence of protected regions. 

Figure 7 depicts the quantitative analysis of Sar A binding to regions within the P2- 
P3 promoter by EMSA. (A) Representative EMSA of SarA and various DNA fragments 
used in this study. ^P-labeled DNA fragments were equilibrated with purified SarA, 
resolved by native PAGE and detected by phosphorimaging. The DNA fragments used are 

30 indicated to left side of the data. B. Binding isotherms of EMSA data from A. The 
concentration of bound DNA was calculated and plotted versus the concentration of SarA 
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present in the sample. DNA fragments used: P3(A), P2(H), AC(«), Al/A2(^), 
B1/B2(V). 

Figure 8 depcits EMSA of SarA with entire agr enchancer fragment under 
stoichiometric binding conditions. 
S Figure 9 depicts the initial structure of SarA. Helix 1 (residues 10-30) is very basic 

and likely represents the DNA-binding domain. 

Figure 10 dq)icts the summary of helical arrangement of SarA from preliminary 
structure. The missense and nonsense changes in RN6390 are also indicated. 

Figure 11 depicts the atomic force microscopic (ATM) image of SarA bound to 
10 DNA. The DNA fragment includes the P2 and P3 promoters and the s^roximately 250 bp 
ofRNAmgene. 

Figure 12 dq>ict5 a proposed model of SarA-induced superhelical activation of P3 
promoter. SarA is shown as a dimer fliat can bind to the A, B» and C sites identified by 
Dnase I, as well as to a presumer site downstream of P3. By a mechanism that is not 
IS understood, SarA activates the promote, perhaps by tetramerization. in response to 
quorum establishment. 

Figure 13 depicts the agr locus and DNA fragments for study. The binding sites 
determined by Dnase I foo^rinting are indicated. 

Figure 14 depicts a PNA invading duplex DNA to form a ternary PNA-DNA 
20 complex. 

Figure IS depicts the results of a binding eiq^eriment between die SarA binding site 
A1/A2 and a PNA molecule. 

Figure 16 dq>icts the results of a binding e;q>eriment between die SarA binding site 
A1/A2 and a PNA molecule with and without die addition of SarA. 

2S DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

The present invention combines gen^c, biochemical and structural studies to define the 
mechanism by which SarA controls expression of the genes in the agr locus. The agr locus 
includes agrDCBA, ENASUhld (See Fig. 1). This work, in turn, allows the design and 
synthesis of inhibitors of SarA amd the testing of the effects of these inhibitors of SarA and 
30 its regulated vurulence genes with a major aq;>hasis on the development of novel anti- 
staphylococcal therapeutics to combat the onslaught of drug resistant staphylococcal 
pathogens. 
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The present invention is directed to inhibitors of SarA function involved in the 
egression of staphylococcal virulence genes and a method of treating a staphylococcal 
infection conq;)rising administering to a subject having a staphylococcal infection at least 
one inhibitor of SarA function involved in the expression of staphylococcal virulence genes. 
S The inhibitor is designed to interfere with SarA mediated activation of the agr locus and 
more specifically the inhibitor interferes with the binding of SarA to at least a portion of 
the agr locus. The portion of the agr locus is composed of greater than about 70% 
adenosine-thymidme (A-T) nucleotides, more preferably at least about 75% A-T 
nucleotides, and most preferably between about 79% and 89% A-T nucleotides. More 

10 spedfically, the inhibitor interferes with.the binding of SarA to at least a portion of the agr 
locus dq)icted in Fig. 3. The inhibitor is designed to interfere with the binding of SarA to 
at least one of the portions of the agr locus depicted in Fig. 3 that contains a nucleotide 
sequence selected from the groiq> consisting of at least the nucleotide sequences in the Al 
and A2 boxes, at least the nucleotide sequences in the Bl and B2 boxes and at least the 

IS nucleotide sequences m the CI and C2 boxes. Further, the inhibitor also interferes with the 
binding of SarA to the intervening nucleotide sequences between the Al and A2 boxes 
when the inhibitor bmds to the nucleotide sequence in the Al and A2 boxes, to the 
intervening nucleotide sequences between the Bl and B2 boxes when the inhibitor binds to 
the nucleotide sequences in the Bl and B2 boxes or to the intervening nucleotide sequences 

20 between the CI and C2 boxes when the inhibitor binds to the nucleotide sequences in the 
CI and C2 boxes. The most preferred inhibitors of SarA bindmg bind to at least a portion 
of the agr locus, and the most preferred inhibitors are oligonucleotide analogs that utilize 
the Watson & Crick basepairing to bmd to nucleic acids, such as peptide nucleic acid 
molecules, DNA molecules, RNA molecules, phosphothiolate oligonucleotides, and anti- 

25 sense oligonucleotides. Hairpin polyamides are also preferred inhibitors of ihe present 
invention. Any molecules that can enter a staphylococcus and interfere with the SarA 
function in the expression of staphylococcal virulence factors are encompassed by the 
present invention. Such molecules can be synthesized, tested and identified utilizing the 
methods disclosed in the present invention. The inhibitor is admixed with an acceptable 

30 carrier, such as a pharmaceutically acceptable carrier for administration to the subject 
having a staphylococcal infection. The pharmaceutically acceptable carrier may contam 
preservatives and other non-immunogenic additives, according to methods well known in 
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the art. See, e.g. Rramungton's Pharmaceutical Sciences: Drug Recq>tors and Receptor 
Theory, (1990). The carrier may contain additives that are known to fiacilitate the 
movement of the inhibitor into the staphylococci without adversely affecting the subject 
treated. Sudi additives are known to skilled persons or can be selected using known 
5 methods. These additives faciUtate the movement into the ceU of inhibitors that do nd^ 
readily cross cell membranes, and peptides are known additives that fiacilitate such 
movement across the cell membranes. 

The inhibitors of the present invention are intended for use in treating all 
staphylococcal infections. As discussed above, S. aureus, S. epidemiidis and S. 

.10 saprophyticus are the three disease causing species. It is known that 5. aureus and 5. 

^{die/ir»d£f have homologs of bodi the 5ar and a;r genes. The sequraces of these genes in 
odier species of staphylococci are used to prq>are inhibitors of SarA function in these 
species. IJkewise SarA itselfis used to devdopuihxbitorsti^ 
cannot bind to the agr locus. 

IS The following comparison shows diat the SarA protein from S.aureus and 

iSr.fpid^/7ni<& are quite similar. 

Sara S. aureus MAITKINDCFELLSMVTYADKLKSLIKKEF 

Sara S. epi MAISKINDCFELLAMVTYADRLKGIIKKEF 
SISFEEFAVLTYISENKEKEYYLKDIINHLNYKQPQVVKAVKILSQEDYFDKKRNEHDER 
^ 1 1 M M I I I I I I I I I I I I : I 1 1 I I I i I I I I I 1 I I I I I 1 1 I I I I I I I : I I : I I I I I I I I I 

SISFEEFAVLTYISENKEEEYYLKDIINHLNYKQPQVVKAVKHLSQENYFNKKRIIEHDER 

TVLILVNAQQRKKIESLLSRVNKRITEANNEIEL 
I 1 I t I I : : : 1 i I I I : : i t : I I I : t I I 1 1 I 1 I I : . 
TVL'ILVDSKQRKKI DDLLKRVNNRITEANNENEV 

The present invention also is directed to a method of identifying inhibitors of SarA 
25 function involved in the expression of st^hylococcal virulence genes conq)rising: a) 
contacting a candidate inhibitor with at least one SarA binding site of the agr locus in 
solution to allow the binding reaction to equilibrate for a sufficient period of time; and b) 
assessing the binding of said candidate inhibitor to said SarA bimling site of the agr locus. 
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(See Figure IS for an example of the results of this method.) The method further 
comprises the addition of SarA to the solution of step a) sunultaneously with the inhibitor 
and the SarA binding site, and then an assessment of the binding affinity of the candidate 
inhibitor relative to the binding afiSnity of the SarA to the SarA binding site of the agr 
5 locus. Alternatively, the addition of SarA to the solution of step a) is sequentially with the 
inhibitor and the SarA binding site, and then the assessment of the binding affinity of said 
candidate inhibitor relative to the binding affinity of said SarA to the SarA binding site of 
the agr locus. (See Figure 16 for an exanqile of the results of this method showing the 
results with and without the addition of SarA.) 

10 The assessm^ of bmding or die lack of binding of the mhibitor to the SarA binding 

site and/or of SarA to the SarA binding site is p^formed by an electrophoretic mobility 
shift assay (ENfSA) described herdn and in Rechtin a al, (1999). An additional method 
for assessing the binding of tfie inhibitor and SarA to the SarA binding site is fluorescence 
anisotropy (Nikiforov et al. , 1999) 

15 An inhibitor within the meaning of the present invention should have a Kq of less 

than about 1 nM. However, a lower in the pM range is more preferred. The should 
be low enough for an inhibitor of SarA binding sites to bind to the extent that SarA cannot 
bind to these sites. In an EMSA, as the inhibitor concentration increases, the SarA-SarA 
binding site complex decreases. If the inhibitor is PNA then the DNA (i.e., SarA binding 

20 site)-PNA conoplex increases. If the inhibitor acts on SarA alone and does not bind to the 
SarA binding site, then the free or unbound DNA (SarA binding site) inoreases. (See die 
results in Figure 16) 

Specifically the SarA bmding site of the agr locus that is usefiil m the present 
method is at least one nucleotide sequence selected from die group consisting of at least the 

25 nucleotide sequences in the Al and A2 boxes, at least the nucleotide sequences in the Bl 
and B2 boxes and at least the nucleotide sequences in die CI and C2 boxes as depicted in 
Fig. 3. 

Additionally, the present iavention includes a method of identifying inhibitors of 
SarA function involved in the expression of staphylococcal virulence genes comprising a) 
30 contacting a candidate uihibitor with SarA in solution to allow the candidate inhibitor to 
affect the ability of SarA to bind to at least one SarA binding site of the agr locus; b) 
contactmg the solution of stq) a) with at least one SarA binding site of the agr locus either 
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simultaneously with the contact of the inhibitor and the SarA or subsequently to the contact 
of theinhibitor and the SarA; and c) assessing the inhibition of (he candidate mhibitor on the 
SarA binding to the SarA binding site of the agr locus. This method allows the assessment 
of the inhibitor directly on the SarA rather than the effect of the inhibitor on the SarA 
S binding site. This method uses the assessment of binding of the inhibitor to the SarA 
binding site by an electrophoretic mobility shift assay. The SarA binding site of the agr 
locus used in this method is preferably a nucleotide sequence selected from the group 
consisting of at least the nucleotide sequences in the Al and A2 boxes* at least the 
nucleotide sequences in the Bl and B2 boxes, at least the nucleotide sequences in the CI 

10 and C2 boxes and a combination therof as dq>icted in Fig. 3. 

The present invention targets the expression of genes in the agr locus directly, by 
inhibiting SarA-mediated activation. This approach offers advantages over known 
approaches because SarA is a pleiotropically-actmg regulator thai controls another 
pleiotropically-acting regulator. By inhibiting SarA, agr will be inhibited as well as any 

15 other genes under SarA control {e.g. , collagen adhesin, cna, Gillaspy et al.^ 1998). 

Synthetic molecules that inhibit protein-DNA interactions have been developed and show 
tremendous promise for pharmaceutical applications. These molecules include peptide 
nucleic acid (PNA, Corey, 1996) and hairpin polyamides (HP, White et al., 1998). Both 
PNA and the HP bind dsDNA with very high specificity and affinity. These molecules 

20 have great potential for anti-stq>hylococcal therapies once appropriate targets are identified. 
Molecules like PNAs and HP& that are designed to bhxl in the agr regulatory region and 
inhibit SarA-mediated activation of genes in the agr locus are considered to be inhibitors of 
SarA function within the meaning of the present invration. Because of the high affinity 
(sub-nanomolar K^), very low amounts of the drug or SarA fimction inhibitor needs be used 

75 in a subject in need of treatment. Furthermore, if the subject is to receive a prosthetic* 
device {e.g. , heart valve or h^)), the drug could be used prophylactically to inhibit S. 
aureus colonization. The inhibitor can be administered prior to surgeiy or can be 
inq>regnated in beads for slow release and packed around a prosthetic hip device for a while 
for local administration. Techniques are known for impregnating beads or substrates with 

30 drugs. These beads or substrates may be biodegradable. 

The present mvention is premised upon the determination of the interactions of SarA 
with the P2-P3 regulatory region, die determmation of the mechanism by which SarA 
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regulates virulence gcat expression in Staphylococcus, and the biodiemical characterization 
of SarA and its interaction with DNA. In the present invention, a full-length SarA in £. 
coU was expressed without heterologous fusions and purified to homogeneity. It was 
determined that SarA was a dimer in the presence or absence of DNA and was comprised 
S primarily of a-helices. The combined results of DNasel footprinting and quantitative 
EMSA experiments indicate that three SarA binding sites exist. Two of the footprints 
overlap elements of the P2 and P3 promoters. All of the protected sequences included 
portions of the hqitad re3>eats described by Morfeldt et al. (1996). One SarA dimer was 
found to bmd each binding site with very high affinity. Three dimers bind the entire region 

10 and produce an unusual laddering pattern in electrophoretic mobility shift assays (EMSAs). 

Based in part upon tihe results disclosed in the present plication and the crystal 
structure of SarA, it is believed, but ^licants do not wish to be bound by ttiis theory, that 
SarA utilizes ''indirect readout** in it recognition of DNA biding sites. This mechanism of 
binding site recognition relies on the 3-dimensional structure of the DNA (agr locus) of the 

IS SarA binding site(s) to place the phosphate backbone in a particular configuration. It is this 
configuration that is bound with high affinity by the protein. This binding mechanism is 
similar to the binding site recognition of the trp repressor of E. coli. This type of binding 
mechanism is the opposite of "direct readout" in which the functional moieties in the major 
groove DNA are bound by components of the binding protein. 

20 The following provides the eiqieriments and results that define the specific SarA 

binding sites on the agr locus and Aeir use in producing and selecting ^n^niate inhibitors 
of SarA function, and more particularly, appropriate inhibitors of SarA binding to the agr 
locus. These molecules, preferably PNA and HP, are designed to intemqpt SarA-agr 
interactions. 

23 Examples 

The examples presented below include the following experim^its required as a 
prerequisite to designing and preparing inhbitors that disnqn SarA and agr function: i) 
cloned the sar genes fi'om the main laboratory strain of 5. aureus, as well as 30 clinical 
isolates, ii) expressed the sar ges^ in £. coli and purified the SarA proteins from strains DB 
30 and RN6390, iii) established a quantitative assay for SarA-DNA interactions 
(electrophoretic mobility shift or EMSA), iv) mapped the binding sites for SarA within an 
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important agr regulatory region using DNase I footprinting, v) determined the equilibrium 
binding constants (Kd) for SarA and putative targrt sites in the agr enhancer region, vi) 
determined that SarA is a dimer using chemical cross-linking and dynamic light scattering, 
vii) grew high quality crystals of SarA, in the presence and absence of DNA, for structure 
5 determination, viii) solved structures of the protein alone and of the protein-DNA complex 
to 2.5 and 2.9 angstrom resolution, respectively, and ix) generated preliminary images of 
SarA bound to the agr regulatory region by atomic force microscopy. For most of these 
studies both the DB and RN6390 SarA proteins were used and gave equivalent results. The 
majority of these results are described in (Rechtin et al. , 1999), which is herein 

10 incorporated in its entirety by reference. 

gon inp^ expr ession and purification of SarA 

As reported by Bayer et al. 1996, there are at least two naturally occurring variants 
of die SarA protein inferred from the gene sequence. Strain RN6390 encodes a SarA 
protein that is eleven amino acids shorter than the SarA from strain DB. There is also a 

IS phenylalanine to l^icine substitution at position S3 in the DB protein. 5. aureus 
chromosomal DNA was isolated as described by Smeltzer et al, (1996). 

Synthetic DNA primers were used for PCR an:q)lification that incorporated 
restriction enzyme sites for subsequent cloning in plasmid vector pET9a. The SarA coding 
region was amplified using PCR from S. aureus strain DB with primers incorporating Ndel 

20 and BomHI restriction enzyme sites: 

Ndel S'GGGAGGTTTTACATATGGCAATTACAAAAATCS' 
BcwiHI - 5'GTTTAATAGAATGGATCCTCTATCAAACTTCACC3\ 

The PCR products from reactions using RN6390 and DB chromosomal DNA were 
s^propriately restricted widi Ndel and AimHI and ligated into likewise restricted pET9A 

2S (Novagen) to yield pET-RN and pET-DB plasmids. The fidelity of the constructs was 
confirmed by DNA sequencing. When the optical density of the culture was 0.4 at 600 nm 
at 37''C, IPTG induction (final concentration of ImM) of £. coli LB broth cultures,(£. coti 
strain BL21(DE3)pLys) carrying these expression constructs was performed. Maximal 
expressio occurred within 3 hoiurs of induction. 

30 Cells were collected by centrifiigation, 5,000 x g, 10 min., 4°C and cell pellets were 

frozen at -20^C. Cells were lysed in 50 mM Tris-HCl, pH 7.5, 1 mM DTT. 1 mM EDTA, 
1 mM PMSF on ice. Chromosomal DNA was sheared with ultrasound treatment and the 
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insoluble debris was removed by centrifiigation, 15,000 x g, 30 min., 4T. The cleared 
lysate was brought to 70% saturation with solid ammonium sul&te and the insoluble 
material removed by centrifiigation 15,000 x g, 30 min.* 4*^C. The soluble supernatant was 
twice subjected to dialysis against at least 100 volumes of HSB-150 (HSB = 20 mM Tris- 
5 HCI (pH=7.6), 1 mM EDTA, 1 mM DTT, and 150 mM NaCl). The resultant solution was 
loaded on a heparin-Sepharose column (50 mL bed volume) and washed with at least 300 
ml of HSB-150. Proteins were eluted with a linear gradient of HSB-150 to HSB-1500 
(HSB with L5 M NaCl). Total volume of the gradient was 400 mL. Column fractions 
were analyzed for the presence of SarA using Tricine SDS-PAGE and peak fractions were 
10 pooled. 

A unique protein of sq^roxunatdy 15 kDa (recombinant SarA) was observed by 
SDS-PAGE analysis (Figure 4). Following examination of several standard purification 
s^roaches, a sin:q>le one-day preparation that yields SarA diat is at least 95% pure was 
selected (Figure 4). The purification protocol involves a short mduction time, lysis with 

15 ultrasound, precipitation of most contaminating proteins with ammonium sulfate, and ion 
exchange chromatography. It should be noted that the SarA used is an intact, full-length 
protein, not harboring fusions that may serve to alter the protein's activity. NH2-terminal 
protein sequencing confirmed that the purified protein is SarA. The concentration of SarA 
monomer was determined spectrophotometrically at 280 mn using an extinction coefficient 

20 of 7740 M'\ calculated using the method of Gill and von Hippel (1989) and absorbance at 
280 nm. The DB and RN6390 SarA proteins express equally well and show no dramatic 
differences in tiie purification procedures. Howev^, since the protein from the DB strain is 
more representative of clinically-relevant forms of SarA (discussed below), the remainder 
of the experiments shown below were done with DB SarA, unless mdicated otherwise. 

25 Purified SarA was stored at -20^ and is stable for at least 6, months without 

noticeable loss of activity. Typical yield of SarA was 1-3 mg/liter of culture. 

To determine the activity of the purified SarA stoichiometric binding conditions in 
EMSA with DNA fragment P3 (Figure 3) was used. At DNA concentrations that are very 
high relative to the equilibrium dissociation constant (K^), the amount of protein required to 

30 bind 50% of the available DNA is used to determine the concentration of active protein 
using the equation Kp = [Plm ' l/2[DNA]o, where [P],/2 is the protein concentration at 
50% saturation and [DNAJo is die total DNA concentration (Riggs et al., 1970; Hurlburt 
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and Yanofety, 1990). Under these conditions ([DNA] > 10 nM), Kp is insignificant and 
the equation simplifies to [V]^^ = m\DNA]^. Assuming that SarA binds to the P3 DNA 
fragment as a dimer (confinned below), the activity of SarA in prq>arations prq)ared by the 
disclosed method is routinely 90 - 95% active of the value determined 
5 spectrophotometrically. The concentration of SarA reflects the concentration of active 
protein. 

Oligomeric State of SarA 

It was determined that SarA was a dimer using both chemical cross-linking and 
dynamic light scattering (DLS) and also widi x-ray crystallography as shown below. 

10 Chemical cross-linking of SarA 

For chemical cross-linking, a lO^M solution of SarA monomer was subjected to 
brief treatm»t with any the of the protein oosslinking agents, bis- 
(sulfosuccinimidyl)suberate (BS^), disnccinimidyl sub^rate (DSS) and/or ditfaiobis 
(disuccinimidyl) (DSP) for various times in the absence or presence of agr DNA according 

15 to Maleki et al. (1997). In the case of DSP, DTT was excluded from the buffer. The 
reaction mixture contained 10 SarA and crosslinker in a total reaction volume of 30 
Reactions were allowed to proceed for 1 minute at 4^C and quendied by the addition of 10 
}il of 1 M Tris-Cl (pH=8.0). Denaturing loading dye was added to each reaction and 
incubated at 95X for 20 minutes. Reaction products were analyzed on a 12% Tricme-SDS- 

20 polyaciylamide gel and visualized by Coomassie blue staining. A product consistent with a 
SarA dimer was observed in Figure SA. A dimer is the only oligomer d^ected und^ any 
conditions tested. The presence of agr DNA had no effect on the cross-linking results. 
The DB and RN6390 SarA proteins behaved identically in this assay. 
DvTi aTnic T ig ht Scattering 

25 The oligomerization state of purified SarA was examined in the presence and absence of 10 
mM MgCls by dynamic light scattering (DLS) using a 2001 DynaPro Dynamic Light 
Scattering Instrument and analysis software, DYNAMICS, version 3.30. DLS reveals the 
homogeneity and oligomeric state of proteins in solution based on diffraction of visible 
light. Two sets of measurements were made widi protein in the absence of MgCIa- In the 

30 first, SarA was concentrated to 3.3 mg/mL (230 fiM), which is necessary to provide the 
experimental signal, and in the latter to 11.7 mg/mL (813 fiM), which is the concentration 
used m thecrystallization experiments (described below). It is inq>ortant to note that the 
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chemical cross-linldiig experiments were performed at much lower concentratioiis of SarA, 
concentratioiis at which DNA is coiKq)letely bound in EMSA experimrats. In eadi 
experiment, SarA (3.3 mg/ml) was buffered by solutions containing SO mM Tris-HCl, pH 
7.5; 2mM DTT; 500 mM NaCl; ImM EDTA. Twenty measurements were taken at 2TC 
5 for each analysis. The bimodal analysis of the scattering revealed a monodisperse solution 
with a macromolecular weight of 34 kDa. This is consistent with the scattering from a 
slighdy elongated SarA dimer (calculated MW of 29.4 kDa). Interestingly, in the presence 
of 10 mM MgClj, the bimodal analysis of the 11.7 mg/mL SarA solution revealed a 
monodisperse solution with a macromolecular species of molecular weight S6 kDa. This 

10 suggests a divalent metal ion-dependent tetramerization and more globular fold to the 
tetrameric species. Taking die cross-linking and DLS e3q>eriments togetfan. SarA appears 
to be a stable dimer, which can form tetramers at U^gb protein concentrations and in the 
presence of Mg. Tetramer formation may be a critical activity in the r^[ulation of agr 
operons by SarA. 

15 Determination of the activitv of recombinant SarA 

With the oligomeric state of SarA known to be a dimer and the binding sites further 
defined by DNase I footprinting, stoichiometric binding analysis was performed to 
determine how active the E. coli expressed, pure preparation of SarA was and die 
stoichiometry of SarA-DNA complexes. Using synthetic DNAs (A1/A2 or B1/B2 shown in 

20 Figure 3) as targ^ in EMSA, it was d^rmined that recombinant SarA is approximately 
95% active, which correspomls well with die level of purity observed in SDS-PAGE. These 
experiments were carried out by titrating a concentration of ^-labeled DNA that is 100- 
fold higher dian the q)parent Kq with SarA (Riggs et al., 1970; Hurlburt and Yanofsky, 
1990). The amount of DNA pres^ in the shifted and unshifted bands was determined by 

25 phosphorimagmg. The concentration of protein required to shift 50% of die DNA was used 
to determine the stoichiometry and activity from the equation: Kq = [SarA],/2 - 1/2[DNA] 
which is derived from the equilibrium expression. Under conditions where the DNA 
concentration is very high relative to the K^, the equation becomes [SarAJiyj = 1/2[DNA]. 
Since the concentration of the synthetic DNA fr^igment is known one can accurately 

30 calculate the activity of the DNA-binding protein. 
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Detenninatioii of SarA secondary structure usinp circular didiroism 

The protein concentration utilized for this study was 1.8 mg/nd in a solution of SO 
mM potassium phosphate, pH 7.S. The circular dichroism spectrum was taken from 260 
nm to 180 nm. The spectrum showed large negative ellipticity at 208 nm and 220 nm 
5 indicative of a high helical content (Rechtin et al., 1999). Using the spectrum from 260 nm 
to 190 nm, the percentage of each secondary structure element was calculated. The results 
were: Helix - 54.9%, Strands - 6.3%, Turns - 14.5% and Random coil - 26.7%. 
DNase I footprinting of SarA binding to the putative aer regulatory region 

Based on die work of Morfeldt et al. (1996), the 130 bp that span the -35 regions of 

10 the P2 and P3 promoters of the genes of the agr locus contain target binding sites for SarA 
(Figure 2). This work further indicates that a DNA fragment containing 50 bp upstream of 
P3, in the context of the intact RNAm gene is sufficient for qypropriate, Sarnmediated 
ei^ression of RNAIII in S. aureus. The four h^tad rqieats (indicated in Figure 2) 
divergently arranged within the 130 bp region were considered to be important and that 

15 notion was supported by qualitative EMSA data in that work. To test this notion and 
identify binding sites for SarA in this region, DNase I footprinting was perfomred with the 
purified SarA. The target DNA for DNase I foo^rinting was a fragment of plasmid 
pBKH50, which contains the P2 promoter, the P3 promoter and the intervening DNA. ^P- 
end-labeled DNA tai^)lates (40 pM) were titrated with DNase I to establish the conditions 

20 for one cleavage per molecule^ then those conditions were used with a series of SarA 
concentrations. The data were quantified by phosphonmaging. A representative DNase I 
footprinting gel is shown in Figure 6A. The other strand of DNA was labeled and used in 
fbotprintmg experiments, showing essentially the same results (data not shown). From this 
analysis, protected regions are obvious at very low concentrations of SarA (marked Al, 

25 A2, Bl, B2, CI and C2 in Figure 6A). The footprinted regions were protected by more 
than 50% in the presence of 3 - 6 pM SarA and do not correspond to the divergent hq)tad 
repeats to the two promoters. Thus, the interactions of SarA with this DNA are of very 
high affinity, an observation that is confirmed by the EMSA data (below). Furthermore, 
using EMSA under stoichiometric conditions, it is shown that each pair of footprints is 

30 bound by one SarA dimer and that the entire region is bound by three SarAs (data not 
shown). The sequences of the fooQ>rinted regions were aligned and a consensus derived 
using programs resident in the GCG software package (Figure 6B). A summary of these 
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data is presented in Figure 3. The bold boxes indicate the foo^rinted regions, whidi are of 
uniform size and spacing of half sites. It should be noted that the A1/A2 site lies con^letely 
within a DNA fragment shown by Morfeldt et al. (1996) (see Figure 2) to be sufficient for 
SarA-mediated regulation of RNAm expression (pEXOSS, see Figure 3). DNA sequences 
5 with high homology have also been identified downstream of the P3 promoter in the 
RNAm gene. 

Specifically, the conditions for performing the Dnasel fooq)rinting methods are 
disclosed in Rechtin et al. (1999), which is herein incorporated by reference in its entirety. 
Briefly, the region of DNA (shown in Figure 3) was cloned from the genomic DNA of the 

10 S, aureus strain DB by PGR anq[)lification. The primm contained BamHl sites and the 
PGR product was cloned into the BamHl site of the plasmid, pUCllS, to form pBKHSO. A 
240 bp DNA fragment contaming the P2-P3 promoter region was PGR amplified from the 
plasmid pBKHSO with die ^-end-labeled primers, -40 and M13 reverse. SarA was 
allowed to bind to the DNA m a reaction mbcture contammg 40 pM of ^-labeled DNA, 10 

15 mM Hepes (pH 7.6), 5 mM MgClj, 1 mM CaClj, 1 mM DTT, and 100 mM KCl at room 
temperature for 30 minutes. An amount of DNase I, which produced approximately 50% 
non-nicked DNA, was added to the reaction mixture and incubated for 2 minutes. The 
reaction was quenched by addition of stop solution (80% EtOH. tRNA (1 ug/ml), 0.3 M 
NH4OAC) and immediately put mto a dry ice/EtOH bath for 30 minutes. A visible pellet 

20 was obtained after centrifiigation and washed with 70% ^hanol. Following denaturing gel 
electrophoresis, the DNA fragmrats were detected and quantified by phosphoimaging* The 
radioactivity in the bands for samples containing SarA was subtracted from the no protein 
control using Microsoft Excel. The concentration of SarA required to protect a cleavage 
site fiilly was detmnined. To identify the regions of protection within the inter-promoter 

25 region, purine sequencing was mcluded m the analysis. Chemical sequencing was done 
according to standard procedures. 

Determination of the IC> values for SarA and putative binding sites 

To determine the affinity of SarA for the putative binding sites, quantitative EMSA was 
performed with synthetic DNA fragments shown in Figure 3 (A1/A2, B1/B2, P2, P3 and 
30 AC). A1/A2 and B1/B2 were identified in DNase I footprinting experiments (Rechtin et al^ 
1999). P2 and P3 were proposed by Morfeldt et al. (1996) and AC was proposed by Chien 
and Cheung (1998). In the latter publication, the AC fragment was used to test the 
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iiiq)oitaiice of the footprint rq)orted. The oligonucleotides were synthesized, end-labelled 
with and purified after denaturing polyaciylamide gel electrophoresis (PAGE). 
Complimentary DNAs were annelaed» purified by native gel electrophoresis and used in the 
binding experiments. Quantitative EMS A was performed with these synthetic DNA 
S fragments to determine both the stoichiometiy of SarA binding and equilibrium dissociation 
constants (Kq). DNA concentrations greater than lOnM were used for stoichiometric 
binding analysis. For experiments determining values, concentration of labeled DNA 
used was ejctremely low (< 10 pM). Various concentrations of SarA were incubated in a 
20^1 reaction buffer with ^P-labeled DNA and m buffer containmg IQmM HEPES, pH7.6, 

10 ImM EDTA, 2mM DTT, SO mM KQ» 0.05% Triton X-100 and S%glycerol. Binding 
reactions were allowed to equilibrate for 30 minutes before electrophoresis. Bound products 
were s^arated from free DNA on 6% native polyaciylamide (50:1 aciylamide- 
bisacrylamide) in 0.5x Tris borate-EDTA. Gels were run at 200 V, and tenq)erature was 
maintained at 16^C with a circulating water bath. Resolved gels were dried and the 

IS products quantified by phosphorimaging. The amount of bound DNA was calculated from 
the reduction in the unbound DNA and plotted against the respective SarA concentrations. 

A limiting amount of ^-labeled DNA was titrated with SarA, displayed on native gels 
and quantified by phosphorimaging. Under these conditions, the equation K,, = [SarA],/2 - 
1/2[DNA] sinq>lifies to = [SarA],/2, where the K|> is equal to the concentration of SarA 

20 required for 50% complex formation. In addition, the data were fit and AG values derived 
using the biological equation splver^ BIOEQS (Royer et al.^ 1990; Royer and Beechem, 
1992; Royer, 1993). Representative autoradiograms and the derived binding isotherms are 
shown in Figure 7A and 7B, respectively, and die data are sununarized in Table I, below. 
The DNAs corresponding to footprints A1/A2 and B1/B2 were bound with the highest 

25 affinity (K^ = 7 - 10 pM). The P2 and P3 DNAs were bound with significantly lower 
afiinity (K^ = 200 pM) and an oligo corresponding to the region revealed by Chien and 
Cheung (1998) was bound with relatively low affinity (Kj> = 1 nM), The affinity of SarA 
binding these DNAs correlated with the number of half sites present. A1/A2 and B1/B2 
have two intact half sites. P2 and P3 have 1.5 half sites and AC has one half site. Each 

30 DNA fragment was tested by stoichiometric EMSA and found to be bound 1:1 by SarA 
(data not shown). 
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Table I. 
Simmiary of Binding Constants 



oligo 


size 


half-sites 


AG of dissociation 
(kcal/mole) 




protdn:DNA* 


P3 


45 


1.5 


13.0 


220 pM (1.0) 


1:1 


P2 


45 


1.5 


13.0 


220 pM (1.0) 


1:1 


AC 


45 


1.0 


12.1 


InM(l.O) 


1:1 


A1/A2 


59 


2.0 


14.8 


10 pM (2.0) 


1:1 


B1/B2 


59 


2.0 


15.0 


7 pM (2.0) 


1:1 



''Number in parentheses represents cooperativity values, p values, from 
analysis of binding curves 
5 ^ Ratio of SarA dimer per mole of DNA 



Interaction of SarA widi flie entire P2-P3 region 

From the DNase I footprinting and EMSA results, it is clear that SarA binds multiply to 
the entire agr enhancer region. EMSA presented in Morfeldt ei al. (1996) showed a ladder 
pattern when the entire region was used as a target which indicates multiple protein 

10 binding. Since those data were produced with S. aureus extracts, the present invention 
tested the recombinant SarA for this activity. Figure 8 shows a representative EMSA of a 
DNA fragment corresponding to the entire agr enhancer region performed under 
stoichiometric binding conditions (200 pM). The same ladder pattern is evident. The 
sin^lest interpretation of these data is that an increasing number SarA proteins are binding 

IS to the DNA fragment and causmg the increased r^ardation of the conQ)lexes in EMSA. In 
odier words, SarA first binds to die highest affinity sites and the resultant shifted DNA is 
indicated by 1 in Figure 8. Increasing the SarA concCTtration results in additional dimers 
binding until the entire region is saturated. This simple interpretation leads to the 
conclusion tiiat greater than 9 SarA duners are binding this large fragment of DNA. 

20 However, since the ratio of SarA to DNA is 3:1 at the lowest concentration of protein 
required to completely shift all of the DNA, we consider that conclusion to be highly 
unlikely. Rather, it is believed that some of the bands shown in Figure 8 result from 
conformers with the same stoichiometry. In other words, SarA bound to the A1/A2 site 
may result in the band indicated by 1 in Figure 8, whereas SarA bound to the BI/B2 site 

25 may yield band 2, ete. The different mobilities would result from subtie differences in the 
DNA conformation when SarA is bound at diffident sites. This interpretation is reasonable 
since the binding sites are observed in DNase I footprinting analysis at low concentrations 
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of SarA. In addition, electrcq^horetic mobility changes resulting from conformational 

changes in protein-DNA conq>lexes are well established (reviewed in Lane et al.^ 1992). 

Fimhomore, quantitative analysis of the data indicates that there is no dramatic positive 

cooperativity to the tandem binding events. The results of our EMS A with small DNA 
S firagments shown in Figure 7 also indicate multiple SarA-DNA complexes, but only when 

two intact half sites are present. It is believed that Sar A binds to two half sites and can alter 

diat conformation of the DNA between those sites. 

Crvstallization and prelim inar Y structural characterization of S, aureus SarA 

Data-quality crystals of SarA have been grown by the vapor diffusion method in 
10 hanging drops (McPherson, 1990). Specifically, SarA was concentrated to 20 mg/mL in 20 

mM Tris-HCU pH 7.5, ISO mM NaCl, 1 mM DTT and I mM EDTA and mixed 1:1 (v:v) 

witii tiie crystallization reagoit (5% isopropanol, 10 mM MgC!,, SO mM Tris-HQ pH 7.6). 

This solution is then equilibrated over a 1 ml reservoir of die crystallization reag^ at 4°C. 

The initial volume of the crystallization drop is 20 /iL. SarA crystals grow relatively 
IS slowly, appearing after two weeks. The crystals can contmue their growth for an additional 

2 months. To date, the largest measured crystal dimensions are 0.7 mm x 0.2mm x O.OS 

mm. 

Crystallography 

Small angle (5^ precession photography of the hkO and Okl zones reveals the mmm 
20 synmietry of the crystal and the presence of three 2, screw axes (later confirmed by 
intensity data collectioii). Analysis of these photogn^hs yield unit cell dimrasions of a = 
89.2 A, b = 14S.7 A, c = 4S.8 A and a== p = y= 90^. The space group is P2,2i2,. Witii 
4 equivalent positions per unit cell, it is likely tiiat thm is a tetramer of SarA in the 
asymmetric unit. The is 2.4 A^/Da. A high resolution data set for this SarA crystal form 
2S (1 .7A) was collected at the Stanford Syndirotron Radiation Source. A preliminary structure 
of SarA a carbon backbone is shown in Figure 9. SarA is completely a-helical, as expected 
from the circular dichroism data (summarized in Figure 10). The first helices are the likely 
DNA-binding domains. This notion is based on a high density of basic anuno acids in the 
C-terminal half of helix 1 (K21, K23, K27 and K28) and electron density being visible in 
30 this region in the crystals of the SarA-DNA complexes. To accommodate binding two 
segments of DNA that are separated by two turns of die DNA helix, the DNA must not 
sinq>ly lie in the basic region hole (pnpendicular to the plane of the page). Rather it is 
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more likely that helix 1 from each subunit lies in the major groove of the binding sites and 
the DNA is more parallel to the plane of the page. The DNA between the mapped binding 
sites must be severely bent in order to r^urn to the protein and interact with the N-terminal 
helix of the other subunit. It should be noted that this structure is not homologous to any 
5 previously described transcription factor. 

Crvstallization and preli minary st ructural characterization of S. aureus SarA-DNA complex 
Initial attempts to obtain SarA-DNA co-crystals utilized oligonucleotides based on the 
hq)tad repeats reported by Morfeldt et al, (1996). Several DNAs were used in the attempts, 
ranging from 7 - 21 bp in length. Data quality crystals were grown that diffract to 2.S 

10 angstroms. The crystals take the space group P2„ with a = 54.5 A, b = 65.2 A, c = 57.8 
A and p = 118.0^. An initial three dimensional x-ray intensity data set has beoi collected 
from a small crystal (smallest physical dimension < 0.02 mm) with an R-Axis IV imaging 
plate area detector at ambieut tenq)erature {IVC). X-rays woe generated using a Rigaku 
RU-300 rotating anode x-ray generator, which equipped with Yale focusing mirrors, set at 

15 50 kV, 100 mA. An automatic indexing routine confirmed the cell constants as calculated 
from small angle precession photographs. Intensity data were processed with Biotex (MSC, 
Woodlands, TX). Although diffiraction was observed beyond 2.7 A, the data were 
processed conservatively because of the n^id x-ray induced crystal decay. The Rmc^* for 
data in the 30.0 A to 3.0 A resolution range (41,345 observations) was 8.2% for all 

20 reflections, 7.6% for full reflections, and 9.2% for partial reflections. The final merged file 
contains 7,157 unique reflections and is 75% conQ>lete. (^Rmo^b = 100x(£I^-Io/£ 
W). 

Visualization of SarA complexes with the apr region using atomic force microscopy 

To establish conditions for observing SarA-DNA conq)lexes dkectiy, AFM was 

25 used with sanq>les of SarA and a 400 bp DNA fragment containing the P2 promoter, the P3 
promoter and most of the RNAm gene. This DNA fragment contains all of the SarA 
binding sites identified by DNase I footprinting and from sequence analysis of the RNAUI 
gene. Briefly, a mixture of 10 nM SarA and 1 nM DNA fragment in 10 mM NaP04 pH 
7.4, 50 mM KCl, 10 mM MgCl2 was deposited on a freshly cleaved mica disk (1 cm 

30 diameter), washed with water and dried for 20' prior to imaging. A Nanoscope m 
mioroscope in tapping mode was used to image the molecules on the surface of the mica at 
1.798 volts, at a scan rate of 2.035 Hz. The image shown m Figure 11 is representative of 
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Other complexes preseDt on the surfece. It spears that SarA binds to at least 3-4 positions 
in the DNA fragment* at both ends, and approximately 1/3 of the way from one end. The 
largest complex (top right) may represent tandCTily bound SarA dimers. The sin^lest 
interpretation of this data is that the Sar A-DNA complex at the bottom of the image is cis to 
S the P2 promoter (C1/C2 site), the Sar-DNA complex 1/3 of the length of the fragment in is 
cw to the P3 promoter (A1/A2 site) and the large SarA DNA complex at the upper right is 
within RNAm. It is not clear if the B1/B2 site was occupied in this image. 
Worldnp model for SarA regulation of the genes in the apr locus 

Based on the data obtained and reports from other studies, a worldng model of SarA- 

10 agr interactions was generated. In the model, SarA binds to DNA and increases the 
siq>erlielical turns of nearby promote DNA (schematirally depicted in Figure 12). in the 
case of agr^ this activates expression, in^lidt in this model are the presence of SarA 
binding sites downstream of die regulated promoter as well as iqpstream. These sites are 
predicted by sequence analysis and tentatively observed in Ae AFM stu^. In the worldng 

IS model, dimeric SarA binds to a DNA target upstream of P3 and another dimer binds 
downstream, of P3. The dimers would associate to form tetramers by an presently unknown 
mechanism {e.g., post-translational modification or small molecule effectors). The 
intervening DNA is topologically altered, specifically, it is over-wound. Over-winding 
would place the -35 and -10 regions of the P3 promoter closer together and allow higher 

20 affinity binding by RNA polymerase. 

The following observations are accounted for by this model: 

1. SO bp upstream of P3 is necessary for SarA-mediated regulation of RNAm (pEXOSS in 
Figure 7 above, and Morfeldt ei al.^ 1996). SarA is a dimer, which binds to the SO bp 
DNA fragment containing the A1/A2 site. SarA dimers can associate and form 

25 tetramers under appropriate conditions (Schumacher, Hurlburt and Brennan, 
unpublished). 

2. the -10 and -35 regions of the P3 promoter are 3 bp too far qiart for highly efficient 
binding by RNA polymerase. 

3. deletion of 3 bp from the P3 promoter results in constitutive, SsnA-independent 
30 expression (Morfeldt et al. , 1996). 

4. There are regions downstream of the P3 (and P2 promoters) that are homologous to the 
highest affinity SarA binding sites (the present invaotion) and preliminary AFM shows 
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SarA binding downstream of P3 (the presrat invention). 

5. SarA is expressed constitutively Aroughout die exponential growdi phase and into post- 
e^netial stages (Blevins et al., 1999). 

6. SarA, expressed in E. coU (unmodified or un-affected by small molecule effectors) can 
5 bind DNA sites with very high affinity (Rechtin et al, 1999). 

7. Although not widespread, precedents exist for superhelical control of promoter activity 
(e.^., MerR of E. coli, Ansari et al,^ 1992; Ansari et al., 1995). 

8. Commensurate with activation of agr genes, SarA represses the collagen adhesin gene 
{cna; Giilaspy et al.y 1998; Blevins et al.^ 1999). The cna promoter has perfect spacing 

10 between the -10 and -35 regions. Binding of SarA and subsequent overwinding of the cna 
promoter would reducetfae afGnity of RNAP and reduce expression. MoR can act either 
as a rqiressor and an activator, dependent on its induced DNA topology dianges (Ansaii 
era/., 1995). 

Homology of SarA and aer regions in clinical isolates 

15 The present invention is based on the determination of the medianism of SarA 
regulation of the two operons in the agr locus and developing inhibitors of that regulation. 
Based on the strain-to-strain variation observed by Novick and colleagues (Ji et al., 1997), 
it was a concern that the inhibitors would not work on all clinically-relevant strains. To 
address this issue, the SarA structural gene and agr enhancer region from thirty genetically 

20 distinct strains of Staphylococcus aureus ^ most of which are clinical isolates, were cloned 
and sequenced (Smeltzer et al., 1996; Smeltz^ et al.^ 1997). Briefly, the regions of 
mterest were an^)lified by PGR and cloned into Ae pTOIK>-CR2.1 vector. Several 
mdependent clones were sequenced (both strands) from each strain and compared to strain 
DB for differences. With regard to the SarA coding region, the only changes were observed 

25 in strain RN6390: all of the clinical isolates were identical to DB. The agr »hanca: region 
was identical m all stnuns tested. This result is very important as it supports the utility of 
the present method of treatment against a broad scope of staphylococcal strains and 
supports the effectiveness of an inhibitor of SarA in the clinical setting. 
Sunmiarv of experiments studving the nature of the SarA binding site in the agr repion 

30 The interaction of purified SarA with the DNA region between the divergent promoters 
for the agr operons, P2 and P3 was biochemically characterized. In DNase I analysis, 
three sets of SarA-dependent footprints were detected with the highest affinity sites, Bl and 
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B2, in the inter-promoter region more proximal to the P2 promoter. Two pairs of 
foo^irints with slightly lower affinity for SarA, A and C, that overlapped to Ae -35 
positions of die P2 and P3 promoters were also observed. Determination of binding 
stoichiometry by EMSA of the entire inter-promoter region revealed a SarA:DNA ratio of 
5 3:1, indicating that each pair of binding sites was conq)rised of two half-sites. It is 
interesting to note that the half sites are two turns of the DNA helix apart, in contrast with 
the binding sites for many other well-studied prokaryotic regulatory proteins, e.g.^ trp 
repressor, X repressor, cro, lac repressor, which are one turn of the helix s^art. The 
binding sites determined by our DNase I footprintmg do not overlap all the heptad repeated 

10 sequences observed by Morfeldt et al. (1996). However, the Al and A2 sequences are 
within the construct pEXOSS, which were rq)orted to contain die minimal sequence 
required for SarA-dq>endent transcription of RNAm (see Figure 3). These results do not 
concur completely widi Chien and Cheung (1998). The discrqiancies b^een these results 
and those of (Cheung's group can most easily be explamed by the nature of die recombinant 

IS SarA protein. Whereas, the characterized SarA is expressed as a full-length, unmodified 
form, Cheng's SarA was expressed as a N-terminal GST fusion. It has been determined 
that the N-terminus is very sensitive to modifications. 

Quantitative EMSA was used to test the affinities and stoichiometry of SarA for the 
sites described above, as well as sites proposed by others using synthetic DNA fragments. 

20 The values derived from analysis of the results show SarA binding affinity was the 
highest for DNAs containing intact binding sites and decreased when only partial sites were 
preset. The stoichiometry of SarA binding to all the DNA fragments we tested was 1:L 
The Kd values detennined in this study are dramatically lower than those rq)orted by CMen 
and Cheung (1998). The results presented of SarA interactions with the agr region shows 

25 that there is a stable dimer of SarA binding prefemtially to three specific sites, each 
containing two half-sites, upstream of the regulated P2 and P3 promoters. Only one half 
site (Bl) is contained within the DNasel foo^rint rq)orted by Chien and Cheung (1998). 

Whereas a consensus site was derived from the six half sites, the optimal sequence 
for a SarA binding site is still not obvious. Identification and characterization of other sites 

30 in genes that are directly regulated by SarA, e.^., oia, (Gillaspy et ^z/.,1997; Blevins et al.^ 
1999) will contribute to this knowledge. Significant homology has been observed among 
sequences cis to genes with altered expression in SarA mutants (e.g., tst, spa, hlb, seb, hia) 
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with the binding site described by Chien ai^ Cheung, whidi contains the Bl half site (Chan 
and Foster, 1998). However, none of these genes has been shown to be dkectly regulated 
by SarA. A common theme among the SarA protected sequences described in the present 
invention is AT-abundance (79 to 89%). 
S The miilti-band pattern of SarA-DNA con^Iexes observed in EMSA experiments 

with the larger fragments of DNA are similar using SarA in extracts of S. aureus (M orfeldt 
et al., 1996) and recombinant, purified SarA. These multiple bands are only present when 
the DNA fragment used m EMSA contains at least two intact half sites. As mentioned 
above, a sinq)le interpretation of this phenomenon would have different SarArDNA 

10 conformations utilizing different sets of half sites. However, sevoul lines of evidoice lead 
to a more complicated, yet interesting model, namely fliat SarA induces dianges m the 
supolielicity of the DNA fragment (originally proposed by Morfisldt et at., 1996). First, 
the spacing between die -10 and -35 regions of the P2 and P3 promoters is sqq>rox]mately 3 
bp too far. Deletion of 3 bp from die P3 promoter resulted in constitutive, SarA- 

15 independent expression of RNAIU (Morfeldt et al. , 1996). Thus, one effect of SarA is to 
overcome this spacing. Next, the EMSA with the A1/A2 and B1/B2 fragment shows two 
band shifts occurring simultaneously, although the protein:DNA ratio is 1:1 and only two 
half sites are present. These two bands might correspond to differences in structure due to 
SarA binding an AT-^rich sequence and changing the DNA conformation. Lasdy, the gene 

20 encodmg the collagen adhesm (cna) is rq>ressed by SarA in late stage cultures, the time 
when SarA activates the P2 and P3 prmnoters (Gillaspy et a/.,1997; Blevins et al., 1999). 
The spacing of -10 and -35 regions for die cna promoter is nearly optunal. Thus, SarA 
bindmg may serve to over-wind regulated promoters acting as both a repressor and an 
activator. Precedence for modification of DNA structure has been clearly shown for the 

25 MerR protein in E. cott (Ansari et al.^ 1992; Ansari et al., 1995). One requirement of 
such a model may be that SarA binds downstream of the regulated promoter as well as 
upstream. There is preliminary data indicating the existence of multiple sites located 
downstream of the P3 promoter. 

Analysis of the cis regulatorv elements in the pgr locus responsive to SarA control 
30 As described above, and in conjunction with the work of other groups, good evidence 
for cis regulatory elements m the region between the P2 and P3 promoters have been 
provided. Footprinting revealed high affinity bmding sites, which were corroborated by 
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quantitative EMSA analysis. In addition, the left balf of this region was sufficient for 
appropriate expression and regulation of RNAm (Morfeldt a al.^ 1996). However, several 
lines of evidence indicate the possible existence of cis regulatory elements downstream of 
the P3 promoter, in RNAm (see model above): 1) del^on of three base pairs between the - 
5 10 and -35 regions of the P3 promoter yielded Sar-independent, constitutive expression of 
RNAm, which suggests topological changes, 2) the EMSA of this invention indicate that 
major structural changes occur to DNA when it is bound by SarA, 3) sequence analysis has 
revealed the presence of potential binding sites for SarA downstream of P3, and 4) 
preliminary AFM shows SarA binding downstream of P3. To affect the topology of the P3 

10 promoter, SarA appears to have binding sites on both sides of the promoter. The existence 
and significance of diese potential sites and (he as elements already identified upstream of 
P3 are examined by the use of a combination of fiootprinting, mutagenesis* in vivo 
expression and in vitro binding assays. Ftohermore, DNA fragments carrying mutations in 
the SarA binding sites and atomic force microscopy are useful to observe changes in the 

IS binding patterns directly. 

In Figure 13 the P2/P3 region is dq)icted. Previously presented DNase I foo^rinting 
experiments focused on the region between the P2 and P3 promoters (DNA fragment A in 
Figure 13). The fooqprinting analysis on a DNA fragment containing both the upstream 
and putative downstream binding sites (Fragmrat B, Figure 13) is used to test the 

20 possibility of SarA binding sites existing downstream of P3. The DNA fragment (B) is 
anq>lified from plasmid pT7B-agrl2» which contains the oitire RNAm gene and upstream 
region to P2 (depicted as fragment C in Figure 13). The DNase I fooQirinting is performed 
exactly as it was done for fragment A (Reditm a a/, 1999). Briefly, one oligonucleotide 
primer is labeled with ^ using polynucleotide kinase and gel purified. This primer and its 

25 unlabeled ''opposite'* primer are used in limited PCR (ca. 20 cycles) with the fragment B 
clone to generate the end-labeled target DNA for foo^rinting. The concentration of DNase 
I required for 1 cleavage/DNA fragment is detennined empirically and various 
concentrations of SarA are added and footprinting assays performed. The samples are 
resolved on sequencing gels, dried and quantified by phosphorimaging. Radioactive traces 

30 of the Sar A-containing lanes are digitally subtracted from that in the control (no SarA) lane 
using Excel (Microsoft). The concentration of SarA required for a 50% diminution of 
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DNase I cleavage is determined. The fiootprinting technique for SarA already has been 
optimized as disclosed in Rechtin et al^ 1999. 

The presence of SarA-dependent foo^rints downstream of P3 shows that the model 
presented above is correct. The binding sites that have already been identified (A1/A2, 
5 B1/B2 or C1/C2 in Figure 3) may be sufficient for SarA-mediated regulation alone. It is 
important to remember that the experiments described by Morfeldt et al. (1996) showing 
that a DNA fragment approximately SO bp upstream of P3 promoter was sufficient for 
appropriate expression and regulation, contained an intact RNAUI gene. To test if only the 
iq)stream DNA is critical, a rq)orter gene fusion (xy/!E) to the P3 promoter is made and 

10 tested in S. aureus. 

Putative d^-acting regulatory elements are tested by mutagenesis and northern blot 
analysis. Plasmid pT7B-agrl2 (described above) is used as the templafft for mutagenesis 
studies. Briefly, the putative SarA binding sites identified by our previous DNase I 
foo^rinting studies, as well as other putative binding sites identified from the foo^rinting 

IS experiments described above are tested. Mutations from A — > T and T -> A or C — > 
G, G - > C (to not affect GC content) are made in the putative binding sites. Initially. 2-3 
changes are introduced per experiment in order to be confident that the site has been 
disrupted. Synthetic oligonucleotides with the desired changed were synthesized and used to 
prime second strand synthesis of a single stranded DNA ten^iate. Recovery of mutants is 

20 enhanced by using the thiol-nucleotide system for ttie mutagenesis (Amersham) (e.g.. 
Mackintosh et al., 1998). The fidelity of the mutants are determined by sequence analysis. 
Mutants in pT7B-agrl2 are used directiy for DNase I fbotprinting. Mutants for expression 
in S. aureus are cloned into vector pCL84. Following electroporation into E. coU^ 
transformants are selected on tryptic soy agar containing spectinomycin (SO ^g/ml). 

2S Testing by DNase I footprinting is acconq>lished as described above. 

To test the effect of altering genetic elements as to P3, pCL84 clones carrying different 
mutants are purified from coli and integrated into the chromosome of S. aureus strain 
CYL316. CYL316 is an RN4220 strain diat contains a plasmid (pUL112A19) carrying the 
phage LS4a integrase {int) gene. The presence of the integrase gene guides the integration 

30 of pCLM, which includes the LS4a an site but cannot replicate in S. aureus^ into the S. 
aureus lipase {geh) gene. After transformation of CYL316 with dififeroit pCL84 clones, 
integrants are selected on medium containing tetracycline (3 fig/nol) and confirmed by lipase 
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assay (all integrants will be lipase negative) and Southern blot with a geh probe. Lipast 
production is assessed by plating on Jl agar (Dickinson et al., 1968) and using a 
quantitative spectrophotometric assay developed by the Co-I (Smeltzer et al.^ 1992). 

After confirmation of C7YL316 integrants, each mutant is transduced (see below) into 
5 WA400 and an KSAHUsar double mutant. WA400 is a derivative of S. aureus strain 8325- 
4 in which the region encoding RNAUI has been replaced with a chloramphenicol resistance 
marker. The use of WA400 eliminated the possibility that RNAUI can be expressed from 
any source other than the recombinant plasmid introduced into geh. Also, while a direct 
interaction between SarA and components of the agr regulatory system has not been 

10 dmonstrated, the use of WA400 (rather than the a^r-null mutant RN6911) »sures that any 
undefined interaction between SarA and proteins CTCoded within the agrBCDA operon will 
remain possible. As a negative control, plasmids are introduced into an ttNAHUsar double 
mutant generated by transducing the diloranqihenicol-TCsistance gene from WA400 into 
PC1839 (kindly provided by Dr. Simon Foster, University of Sheffield, Sheffield, UK). 

IS PC1839 is defined by the insertion of a kanamycui-resistance marker into the sarA gene 
(Chan and Foster, 1998). Transductions are done using phage Oil as previously described 
(Gillaspy et al., 1998).. Transductants are screened by plating on appropriate selective 
medium (tetracycline in the case of the pCL84 integrants and chloramphenicol in the case of 
the WA400 RNAm mutation). After demonstrating the presence of the non-selected 

20 markers {e.g. kanamycin in the case of PC 1839), transductants are confirmed by lipase 
assay (see above) and by Southmi blot using probes for the geh, sar and agr loci (Blevins 
era/., 1999). 

After each mutant was established in WA400 and in the RNAUI/f or double mutant, the 
efiTect of altering genetic elements as to P3 on the SarA-mediated activation of RNAm 

25 transcription is assessed by Northern blot using probes corresponding to sarA, RNAUI and 
the polycistronic RNAU message derived from the agrBCDA operon. Northern blots are 
quantified by phosphoimaging analysis. To ensiu*e that subtle deferences were detected, the 
results are normalized by comparison to the results obtained with a probe for a 
constitutively expressed 16S ribosomal RNA (Blevins et al., 1999). The WA400 produces 

30 the sarA and RNAII transcripts while the RHAUUsar double mutant did not produce any 
transcript other than a limited amount of RNAII (the amount of RNAII is limited based on 
the established role of SarA in agr expression). RNAm is observed only if a plasmid that 
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includes functional versions of the cis elements required for die SarA-media!ed e}q>Fession 
of RNAm was introduced. To ensure that the effects observed are mediated by SarA, the 
results are interpreted by conq)arison to the level of expression observed with ttie same 
pCL84 clone in the corresponding KHAIWsar double mutant. 
5 Finally, to confirm that the results of the northern blots are biologically relevant, the 
expression of a gene that is expressed under the regulatory control of RNADI: hla^ whidi 
encodes the S. aureus a-toxin is assessed. Production of a~toxin is assessed by northern 
blot, spectrophotometric hemolysin assays enq>loying rabbit erythrocytes and by western 
blot enq>loying a polyclonal, a-toxin-specific antiserum (kindly provided by Dr. John 

10 landolo. University of Oklahoma Health Sciences Cent^, Oklahoma City, OK). Northern 
blots are done as previously desc^bed (Blevins ei al., 1999). For quantitative honolysin 
assays, supematants are harvested from cultures at various stages of in vitro growth and 
standardized based on optical density (A^^ of the bacterial culture. Standardized 
supematants are mixed in microtiter plates with an equal volume of 1% washed rabbit 

15 erythrocytes. After a 2 hr incubation at 37^C, imlysed cells are removed by centrifugation 
and the amount of lysis quantitated by measuring the optical density of the supernatant at 
410 nm. If necessary to obtain more quantitative data, dilutions of the standardized 
supernatant were assayed for comparison. For western blot analysis, standardized 
supematants were electrophoresed using 10-20% polyacrylamide gradient gels and tiien 

20 transferred to PVDF monbranes as previously described (Blevins et al.). After transfer, 
blots were developed using the rabbit a-toxin antiserum and the Phototope-HRP western 
blot detection kit (New England Biolabs, Beverly, MA). 

The possible synergistic effects of multiple binding sites obtained by this ^proach are 
examined by using mutagenesis to combine mutations in individual sites into one DNA 

25 fragment. For exanq)le, if mutants in the A1/A2 site reduces RNAm expression by 50% in 
a sar^ strain and the B1/B2 site reduces it by 30%, these mutations can be combined and 
retested. Potential results are the observation of a 50% reduction, a 80% reduction or 
elimination of RNAID expression. 

In wild type strains, RNAUI expression is low in early-log phase and high in late-log 

30 phase. In saf strains, RNAin expression is very low at all timq>oints. If SarA is binding to 
the proposed sites, then mutation of those sites should duninish or eliminate SarA-mediated 
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activation of P3 and a diminution of RNAm by northern blot and of a-toxin by western 
blot will be observed. 

The extent of SarA at the downstream sites identified above, the cooperativity in 
binding to multiple sites, and the loss of affinity in binding to the mutant sites generated in 
5 the section above are studied. These experiments are inq)ortant with regard to the 
development of inhibitors of SarA-mediated activation of agr genes. The avidity of SarA 
binding is important in determining the appropriate concentration and affinity required of 
potential inhibitors. In addition, the diaracterization of the mutant sites, both in vivo and in 
vitro^ are inq>ortant in identifying (he inq)ortant bases that comprise the sites which is also 

10 important for the design of inhibitors. 

The equilibrium binding constants (KqS) for ttie various SarA binding sites usmg 
EMSA, as desCTibed above and in Reditin ei al. (1999) are determined. Briefly, synlfa^c 
oligonucleotides corresponding to the sites are synthesized and radiolabeled with 
Complementary strands are annealed and gel purified. Limiting (< lOpM) DNA are mixed 

15 with serial dilutions of pure SarA and resolved by native gel electrophoresis. The 
radioactive species are detected and quantified in dried gels by phosphorimaging. Each 
experiment is performed at least three times and averaged. The percentage of bound DNA 
is calculated from loss of the unbound DNA in (he gel and plotted versus the concentration 
of SarA present in the sanq>le. The data are fit using BIOEQS and Origm, as described in 

20 Rechtin et al. (1999). The concentration of SarA tiiat results in 50% bound DNA is equal to 
(he Kd und^ these conditions (Hurlburt and Yanofsky, 1990; C^emik et al., 1996). A 
description of (he derivation of the equation used to derive K|> is presented above. The 
relative afBnity of the various SarA binding sites reveal (he most important ones and 
corroborate the mutagenesis and expression studies described above. Furthermore, if 

25 binding sites are within 20 - 30 bp of eadi other, there is reason to suspect tiiat there may 
be cooperative interactions that may enhance the parent activity of SarA. This hypothesis 
is tested by performing EMSA with DNA fragments containing two or more wild type 
sequences, as well as combination of wild type and mutant sequences. Numerous 
transcription factors in prokaryotes and eukaryotes utilize cooperativity to affect gene 

30 regulation. These experiments reveal such interactions for SarA. 

Based on the quantitative EMSA work with oligos described above and in Rechtin et al. 
(1999), very high affinity binding, with K|> values in the range of 5 - 50 pM are anticipated. 

-35- 



wo 01/03686 PCT/US00/18S25 

Subtle changes in the sequences flanking the binding sites may also have effects on the 
affinity of SarA binding. Cooperative interactions between SarA dimers bound to nearby 
sites are easily recognized by the slope of the bmding isotherms. The power logistic fitting 
function normally used in binding data analysis renders a value p that indicates the slope of 
S the curve (Czemik et al., 1996). p values of 1 are non-cooperative, greater than 1 are 
positively cooperative and of less than 1 are negatively cooperative. 
Further analysis of the Structural components of SarA responsible for apr regulation 

Structural information is important for the rational design of anti-st^hylococcal 
inhibitors. Towards this end, two types of structural determinations are pursued. The first 

10 is tfie traditional x-ray crystallogrs^hic ^proacfa. Protein or protein and DNA are brought 
to high concratration, crystallized and die structures are determined from the pattmi of x- 
ray diffiraction. This approadi typically yields high quality structures at atomic resolution 
(2.S - 1.SA). The present invention has already provided a 1.7 ansgstrom data set for SarA 
alone and crystals of SarA-DNA that diffract to 2.5 angstroms. The second approach is 

15 relatively new in biological structure science, atomic force microscopy or AFM. AFM has 
been pioneered by Dr. Carlos Bustamante and has made a tremendous impact on the field of 
transcriptional control in the past few years (reviewed in Bustamante and Rivetti, 1996). In 
AFM, macromolecular species are visualized on mica surfaces in 3-dimensions. Thus, the 
binding site for SarA can be observed direcdy, as well as any conformational changes SarA 

20 inay induce m the DNA. AFM makes it possible to visualize ternary con^^ P3 
and RNA polymerase. 

Data quality crystals have been grown of SarA and a SarA-DNA conqplex. A 
preliminary structure of the protein alone has been d^ermined. 

As described above, crystals of SarA that difiract to 1.7 A resolution have been grown 

25 and an initial chain trace has been carried out. The structure was solved by Multiple 
Isomorphous Replacement (MIR) (Drenth, 1994) using a thimerasol derivative and a 
selenomethionine-substituted protein (LeMaster and Richards, 1985). This initial structure 
was inq>roved by iterative rounds of restrained least squares refinement using the software 
package TNT (Tronrud et al., 1987) and model rebuildmg with O (Jones ei al., 1991). 

30 Solvent molecules were added at die final stages of refinement. Upon conq)letion of the 
refinement, the free R values were calculated using 10% of the structure factors that have 
been excluded from refinonent (Brunger, 1992). This parameter provides a statistical check 
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on the accuracy of the protein model. Additionally, "omit" maps were calculated in which 
10% of tfie structure will be removed systematically and the remaining 90% refined to 
remove any bias. The omitted electron density maps were inspected on a graphics 
workstation and any problem areas rebuilt and re-refined. Finally, the stereochemical 
5 properties of the final protein structure were assessed with PROCHECK (Laskowski et al., 
1993). It should be enq)hasized here that our current SarA crystal form is excellent and the 
ensuing refinement of the SarA structure will provide an accurate atomic resolution view of 
this virulence gene regulator. 

The current SarA-DNA crystal form described in the application diffracts to beyond 2.7 

10 A. HowevCT, the diffraction limit of these crystals is volume-dependent and x-ray sensitive. 
As a first approach to ext»d their resolution, experiments to ''thickra'* the thinnest oystal 
dimension are carried out. Such efforts entail the fine soreoung of multiple crystallization 
param^rs that include pH, ionic streogdi, counter ion identity, e.g., chloride versus 
acetate or sulfate, macromolecule concentration, drop volume, the ratio of protein to 

IS oligodeoxynucleotide (mole:mole) and the initial volume-to-volume ratio of protein to 
crystallization reagent, which dianges the kinetics of the crystallization process. The 
addition of small molecule effectors, including divalent cations or nonionic detergents, are 
also assayed. Other techniques, such as seeding fresh drops of SarA-Octamer solutions with 
small micro or mini crystals of the complex or the sitting drop or batch methods of 

20 crystallization also are used (McPherson, 1990). Additionally, the x-ray sensitivity of the 
SarA-Octamer complex is overcome by collecting the x-ray intensity data at ^170^C, i.e., 
cryocooled (Rog», 1994). Proven cryosolvents include isopropanol, glycerol, low 
molecular weight polyethylene glycols and 2-methyl-2,4-pentanediol. ''Scooping'* 
experiments, followed by flash filing the crystal in liquid nitrogen are used. The quality 

25 of die resulting diffiraction pattern is assessed using the R-Axis IV imaging plate system. 
Ofien the crystal experiencesan unacceptably large increase in its mosaic spread as well as 
ice rings. However, the determination of the suitable cryoconditions for a number of 
crystals has been successful. Whether or not a cryosolvent is found for the SarA-Octamer, 
the current limiting resolution is expected to be extended significantly, perhaps by as much 

30 as 0.4 A, when intensity data is ollected at one of the national synchrotron facilities. 
Resolution beyond the current limit provides not only a more accurate model of the protein- 
DNA conq>lex, but also a better view of the wsiter structure of the conqilex. The role of 
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water in protem-Iigand is becoming more s^reciated in the biology (Fr^ et al.^ 1993; 
Baker, 1995). 

As a parallel approach to the above efforts to improve the resolution of the SarA- 
Octamer complex crystals, alternative DNA sequences are also screened. A sparse matrix 
5 approach is used (Jancarik and Kim» 1991). The crystallization experiments employ the 
hanging drop, vapor diffusion technique and test a variety of crystallization reagents, 
limited to alcohols and organic solvents, different molecular weight polyethylene glycols 
(200 to 20,000), inorganics and mixtures of these conq)ounds. The e;q>eriments are carried 
out at either 4^ or 18T, as tenq)erature is an inqx>rtant experimental variable. The initial 

10 protein concentration of the SarA dimer is O.S mM and 0.5 to 1.0 mM for the DNA site. 
The molar ratio of the protein to DNA is oftei an inspoTtsaxi parameter in cooqilex 
crystallization and in many successful co^stallizations, ttie concentration of DNA is 
higbor and ranges to b^ond twice diat of the protein (Aggarwal, 1990; Sdiumadier a al.^ 
1994). Perhaps die most critical variable, which must be tested in any protein-DNA co- 

15 crystallization experiment is the length of the oligonucleotide as well as the presence or 
absence of a 5* or 3* overhanging nucleoside (Jordan et al,, 1985; Aggarwal et al., 1988; 
Schultz et al., 1990), From die results discussed above, the oligonucleotides up to 45 base 
pairs long from the P2 or P3 promoters that encompass two heptad repeats (CTTAACT) is 
used. 

20 In a typical crystallization screening e7^)eriment a 2 drop of the oystallization 
reagent is added to a 2 fiL drop of the protein-DNA solution. The 4 mixture is then 
inverted and sealed over a 1 mL reservoir from which the oystallization reagent was taken. 
Because the protein-DNA drop has proximately half the ionic strength as the reservoir, 
water diffuses from the drop until equilibrium is reached widi the larger volume reservoir. 

25 This brings the protein-DNA drop to beyond supersaturation and results m either 
macromolecular precipitation or crystallization. The results are monitored by inspection of 
each drop with a stereoscope. The initial screening process requires over 240 set ups per 
temperature. If a condition looks promising, e.g., crystalline precipitates, small needles or 
small three dimensional crystals, finer screens are defined as described above and tested 

30 until large three dimensional crystals are grown reproducibly. Again, as a point of 
emphasis it is very likely that a number of oligonucleotides of different lengths, which are 
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blunt-ended or include S*-nucleoside overhangs, are tried before data-quality crystals of the 
SarA-DNA con^)lex are grown. 

Initial characterization of new SarA-DNA crystals is done by small angle (S^ to 9®) 
precession photography. The resulting diffraction patterns of the principal zones of the 
5 crystal allows the determination of the Laue symmetry, systematic absences, likely space 
group and the unit cell parameters. Three dimensional x-ray intensity data is collected on 
the native crystals with either a two detector Area Detector Systems Corporation multiwire 
area detector or a Rigaku R-Axis IV imaging plate area detector (Molecular Structures 
Corporation). Collected intensities are processed and converted to structure foctors with 

10 software provided by ADSC or MSC. 

The structure of any SarA-DNA complex is d^rmined, for exanq)le, by Molecular 
Replacement (MR) using the refined 1.7 A resolution structure of the SarA dimer as the 
search model. The genetic algorittun-based program EPMR (C. Kissinger, personal 
communication) is used. An additional qyproach to solve Ae SarA-DNA structure is the 

IS method of MIR, whereby isomorphous heavy atom derivatives are generated by 
crystallizing SarA with oligonucleotides that have had a specific thymine replaced by S- 
iodouracil. Such S-iodouracil containing oligonucleotides are easily synthesized and 
purified. The modified oligonucleotide is then used in crystaUization experiments which 
normally yield crystals. Such crystals provide an excellent heavy atom derivative. An added 

20 benefit of S-iodouracil substitution is that every thymine of the DNA binding site can be 
changed, thus providing the potential for forming sevml highly isomorphous heavy atom 
derivatives. 

Difference Patterson functions are calculated for all derivatives and the software 
packages HEAVY (Terwilliger and Eisenberg, 1983) or Phases-95 (Furey and 

25 Swaminathan, 1997) are used to locate and refine all heavy atom sites and finally to 
determine phases for the protein. If necessary density modification, e.g., solvent leveling, 
as in:q)lemented in Phases-9S are carried out to improve the initial MIR electron density 
map. Chain tracing of the initial electron density map are done with the graphics program 
FRODO (Jones, 1985) or O (Jones et al,, 1991). The initial structure determined by either 

30 MIR or MR, is improved and verified as described above for SarA. Additionally, the 
structural and stereochemical properties of the DNA is assessed using the program Curves 
(Lavory and SUenar, 1988). 
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The initial structure d^ermiiiation of SarA has removed or mitigated many potential 
problems typically associated with de novo crystal structure determinations. Specifically, it 
spears that this high-resolution structure will be successful in the molecular replacement 
structure determination of the current SarA-Octamer complex. Altemadvely, the structure 
5 determination method of multiple isomorphous replacement as described in the previously 
herein is used. If there is a problem where none of the 5-iodouracil containing 
oligonucleotides yield a good heavy atom derivative, alternatively the structure is solved by 
using a selenomethionine-substituted protein and multiple wavelength anomalous dispersion 
techniques (Hendrickson et al.^ 1990). This is a known method to determine structures and 

10 is useful fat the SarAOctamer complex. AU that is necessary is the proper data collection 
procedure at a syndirotron fiicility. Intensity data collection at a synchrotron facility 
increases the resolution of the SarA-Octamer conq)lex by perlu^s as nnich as 0.5 A. A 
final potential problon is the growth of dififracdon-<iaality crystals of other SarA-DNA 
conq)lexes, which contain oligonucleotides that describe more fiilly the in vivo binding of 

IS SarA to P2 or P3. These oligonucleotides are on the order of 45 base pairs long, possibly 

resulting in problemts with their cocrystallization with SarA. The Bremian laboratory has 

had a great deal of success in obtaining crystals of a number of protein-DNA complexes. A 

very large number of crystallization e}q)eriments using a wide variety of oligonucleotides is 

useful in the present invention. 

20 Atomic force microscopic imag in g of .SarA-DNA Interactions and SarA«DNA«RNA 
polvmerase interactions 

Atomic force microscopy (AFM) is a powerful technique for observing macromolecular 

complexes directly. Using this technique, the following can be observe (or not observe): 1) 

SarA-SarA interactions if looping is occuring,, 2) SarA-RNA polymerase interactions, 3) 

25 dianges in these interactions that result from using DNA fragments with mutant SarA 

binding sites. 

Individual conoplexes of SarA with DNA fragments from the agr regulatory region, 
with and without RNA polymerase, are obtained with a Nanoscope m microscope (Digital 
Instruments, Santa Barbara, CA) operating in tapping mode. KNA polymerase from wild 
30 type S. aureus strain DB is purified according to the m^ods described in Deora and Misra 
(1996). After the identification of SarA binding sites, the mutant DNA fragments are used 
for AFM. Furthermore, SarA mutant proteins that have altered activities also are used. 
Mutants that egress stable proteins, but are either negative recessive, negative dominant or 
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super-activators could have dramatically different effects when bound to DNA. These 
properties are be visualized using AFM. 

For images in air, silicon nitride tips of type TESP with 125 \im long cantilevers are 
used» whereas assays in buffer require type NP tips. Samples are prq)ared by depositing S 
5 ^iL of a solution of SarA and DNA in a buffer containing 2-10 mM Mg2+, 50-100 mM 
KCl, 10 mM Tris buffer, pH 7.S, which are die same as for the images described above, 
onto freshly cleaved mica. Then the surface is rinsed with water and uranyl acetate rinses is 
tested, since this proved useful in the dqwsition of trp repressor/operator conq>lexes 
(Margeat et cU., 199S) In addition, the Mg2+ ions bridge die negatively diarged DNA to 

10 the negatively charged, freshly cleaved mica sur£ace. The mica diip is then allowed to dry 
in air for 20 minutes before imaging. For imaging in buffer, a drop of buffer is deposited 
onto the surface after deposition of the conq>lexes. Mean-square md-to^end distances are 
determined for a large sample of DNA molecules in absence of protein to ascertain that 
equilibrium deposition has occurred. In presence of protein, the same measurements are 

IS carried out to determine if SarA binding alters the structural properties of the DNA and 
enhances the binding of RNA polymerase. Moreover, a statistical analysis of the specificity 
of binding will be carried out by counting the nimiber of specific vs. non-specific 
conq)lexes for large numbers of molecules. A number of different salt and protein 
concentration conditions are assayed to increase the ratio of specific over non-specific 

20 complexation. Purification of RNA polymerase from aureus is straight forward, 
consisting of precipitation nucleic add and bound proteins widi PEI, ammonhun sulfate 
precipitation, ion exchange chromatogr^hy and gel filtration (Aromatogrs^hy. 
Identification of inhihitnrs of SarA function 

The present invention is based upon die knowledge tiiat pleiotropically-acting 

2S regulatory &ctors are highly attractive targets for novel anti-staphylococcal dierapies. 
Selecting inhbitors that interfere with SarA function which in turn affects the interaction of 
SarA and agr is particularly s^ipealing, because one can inhibit both SarA and agr gene 
products at the same time. The design of effective inhibitors of the SarA-mediated 
activation of agr gene expression requires a detailed understanding of the mechanism of 

30 SarA regulation. 

Any molecule that interferes with the SarA function involved in the expression of 
virulence factors are operable as inhibitors in the present invention. For example, 
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molecules that would inactivate SarA, sudi as changing its structure or chemical 
conformation, resulting in its inability to bind to the agr locus is intended to be 
enconq;>assed by the present disclosure. Particularly useful molecules in die present 
invention are inhibitors that bind to any one of the binding sites of SarA on the agr locus or 
S several of these binding sites. The binding of these inhibitors to SarA*s binding sites 
prevents the binding of SarA, thus preventing SarA from mediating virulence factor 
expression. 

The present invention is directed to the development, synthesis and identification of 
inhibitors of virulence gene expression in S. aureus. One reasonable ^proach to this is to 

10 inhibit SarA-DNA interactions. However, another potmtially potent point of inhibition is 
dimerization of the protein itself. If SarA cannot dimerize, it cannot activate agr gene 
egression. Therefore, it is important to identify the amino acid residues that are important 
for SarA activity, both DNA binding and dnnerization. To accon[q>lish this goal, the 
standard approaches of mutagenesis and subsequent activity assays are applied. Utilizing 

IS the 3-dimensional structure of the protein, specific amino acids for mutagenesis are diosen. 
Mutants are screened for activity in S. aureus. Additionally, the GST-SarA fusion protein 
used by Chien and Cheung (1998) in pGEX^T-1 (Promega) was constructed to provide a 
possible explanation for disparate results as conq)ared to those of the present invention. 
Mutants of SarA and the SarA-DNA conqplex are designed. Two basic classes of 

20 mutants are inq>ortant: 1) those that specifically disrupt DNA binding, and 2) those that 
affect dimerization. Residues K21, K23, K27, K28 (amino acid/position number) are 
targeted first as they line the basic region hole. Initially each of these residues are changed 
to alanine. However, alternative dumges may be appropriate after completion of the 
analysis of the alanine substitutions. Mutants that will likely disrupt the subunit interactions 

25 are focused on FIO, L12 and L13, all hydrophobic residues at the N-terminal end of helix 1 
that is abutted against helix 5. For the mutagenesis, syntiietic oligonucleotides are generated 
that incorporate the desired changes. These will be utilized m the Sculptor mutagenesis 
protocol according to the manufacturer's instructions (Amersham). The mutants are 
confirmed by sequence analysis. 

30 The sarA mutant alleles to be tested in S. aureus are subcloned into pCL84 for 
integration mto the chromosome of CYL316 (see above). Integrants are confirmed by lipase 
assay and by Southern blot with a geh probe. After confirmation, each sarA mutant is 
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introduced into the PC1839 sar mutant (Table II) by Oil-mediated transduction as 
desmbed above. The activity of each mutant is assessed by northern blot using probes 
specific for RNAII and RNAm (Gillaspy et al., 1995) and by analysis of an RNAIII- 
dependent target gene, hia. E3Q>ression of Ata is assessed by northern blot, quantitative 
S hemolysin assay and by western blot using rabbit a-toxin antisera. 



Table II 

Strains and plasmids 









RN6390 


832&4 laboratory strain 


Richard Novick (Balaban and Novick, \ 
1995) 








|ALC136 


8325-4 sarA mutant (ErmT) 


Ambrose Cheung (Bayer et al., 1996) 














|RN6911 


8325-4 agr-null mutant (Tef) 


Richard Novick (Novick et al.. 1993) 








UAMS-1 


Clinical isolate (osteomyelitis) 


Mark Smeltzer (Gillaspy et al.. 1995) 

P^^^jagef[([©llla5^^^^ 


UAMS-6811 


UAMS-1 agrHfnjII mutant (Tef) 


Mark Smeltzer (Gillaspy et al., 1997) 


UAMS-930 


UAMS-1 sa//agr mutant (KanTTet^ 


Mark Smeltzer (unpublished) 








lUAMS-931 


DB sar mutant (Kan*) 


Mark Smeltzer (unpublished) 








IUAMS-933 


DB sar/agr mutant (Kan'^etO 


Mark Smeltzer (unpublished) 




!MWip«®lrf Wfw Mil 




pCL84 


L54a igeh) integration vector 
(Tef when integrated in S. aureus) 


Chia Lee (Lee et aL, 1991) 



With die advantage of designing die mutants based on the structure of SarA, hiqx>rtant 
amino acid residues are identified readily using ttiis qyproach. DififiBrent mutants in specific 
10 regions of SarA may have dramatically different activities and help identify the residues 
critical for function. 

Mutant proteins that result in aberrant regulation of the P3 promoto: arise from changes 
in DNA-binding, activation, or dimerization. To distinguish between these possibilities, the 
mutant proteins m £. coU are ei^ressed, purified and tested for DNA binding and 
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dimerization using assays described above (EMSA, cross-linking, x-ray crystallography and 
AFM). 

The primers designed for cloning the wild type SarA coding region and PCR to amplify 
the mutant alleles are used and cloned into T7 expression vector pET9a. The construct is 
5 checked by sequencing. The purification protocol is developed is used to generate large 
quantities of pure SarA for bioch^cal analysis (Rechtin et al, 1999). This procedure is 
easy for two reasons: 1) both naturally occurring alleles of SarA (DB and RN6390) behave 
identically in die purification procedure* 2) all of the many mutants of other proteins that 
the PI has dealt with previously (mostly E. coU trp rq>ressor) behaved essentially like the 

10 wild type in purification. The concentration of the purified protein is determined 
spectroph(^metrically at 280 mn using an extinction coefficients diat are calculated firom 
the mutant amino add sequence using Ae method of Gill and von Hippel (1989). The 
percentage of active SarA in tfie purified preparation is diecked by stoichiometric DNA 
biiKling using EMSA with the A1/A2 DNA, as described above. This analysis addresses the 

IS fidelity of the protein in the purified sanq>le. 

For DNA-binding analysis, EMSA with synthetic oligonucleotide DNA targets, (e.g., 
A1/A2 from Figure 3; Rechtin et al^ 1999) is used. For these experiments, limiting ^^P- 
labeled DNA is mixed with serially-diluted SarA and is resolved by native gel 
electrophoresis. (Quantitation is accon4)lished by phosphorimaging of the unbound DNA in 

20 dried gels. The percent DNA bound is calculated, binding isotherms are generated and fit 
using BIOE(2S and Origin. Under Aese conditions die equals the concentration of 
protein requked to bmd 50% of the input DNA (Hurlburt and Yanofsl^, 1990). 

For dimerization analysis, chemical crosslinking as described above is used. Various 
concentrations of wild type SarA and mutant SarA is exposed to agents shown to be 

25 effective as described above (BS^ or DSS). The san[q)les are quenched and resolved by SDS- 
PAGE. If the protein is less able to dimerize, relative to the wild type, a higher 
concentration of protein is required to achieve 50% duner in this assay. This assay reflects 
the relative ability of the mutant to dimerize, rather than a Kq for the monomer - dimer 
equilibrium. Mutant proteins are be tested by dynamic light scattering as described above. 

30 Blocking a ci5-acting binding site for SarA with a small molecule inhibits SarA- 
mediated gene activation. In the present invention, the binding sites are those identified 
already herein and in Rechtin et al^ (1999) and those that are to die P3 promoter. 
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Among the small molecules tbat have the desired properties^ peptide nucleic acid (PNA) 
and hairpin polyamides (HP) have received the most attention and show the most promise 
as pharmacological agents (e.g., Gottesfeld ei al., 1997; Corey 1997; White et al., 1998; 
Good and Nielsen» 1998; Dickinson et al., 1998). Based on the analysis of the cis 
5 regulatory sites discussed above, appropriate test PNA and HP molecules for use as anti- 
staphylococcal agents have been designed, synthesized and tested. 

PNA utilizes Watson-Crick base pairing for specificity. The advantages of PNA4ike 
molecules are many. First, they are not found in nature and thus are more stable than DNA 
or RNA. Additionally, they do no have a negatively charged backbone which makes them 

10 bind to DNA or RNA more stably than DNA or RNA would bind. The DNA-PNA 
h^eroduplex is much more stable that the DNA-DNA homoduplex because of the lack of 
negative charge on the PNA backbone and electrostatic repulsion. . As a result PNA can 
displace a strand of duplex DNA as shown in Figure 14. The resultant trimeric conq)lex 
does not resemble a binding site for proteins. In fact, PNA has been successfully used to 

IS inhibit gene expression when designed to interfere with RNA polymerase and/or 
transcription factor binding (Hanvey et al., 1992; Knudsen and Nielsen, 1997; Boffa et al. 
1991 \ Lee et al., 1998). In addition, appropriately designed PNA molecules also inhibit 
gene expression in bacteria and bacterial cell growth (Good and Nielsen, 1998a and 1998b, 
respectively). The main problem with using PNA to inhibit cell growth is low membrane 

20 permeability (Wittung et al., 199S; Good and Nielsen, 1998a and 1998b). Nevertheless, 
. considmble effort is underway in many academic and pharmacratical laboratories to 
overcome this limitation via diimeric PNA molecules Q)ersonal communication). For 
example, PNA-peptide chimeric molecules show promise in this area Q)arsonal 
communication). The peptide is utilized to facilitate endo^tosis and is digested by 

25 intracellular proteases. This has the added feature of concentrating the PNA inside cells. 
Unlike antisense strategies with RNA or DNA, PNA is not degraded by cellular enzymes. 

As discussed above, PNA molecules alone have one documented drawback, they do 
not cross membranes well. Currently, a tremendous amount of effort is being spent in 
academic and iiKlustrial settings to derive ''carriers'* for PNA. For example, one research 

30 group has published increased membrane passage by PNA-peptide conjugates (Hamilton et 
al., 1999) and another group has published a synthesis of PNA-peptide conjugates 
(Goodwin et al. 1998). These conjugates provide a carrier for membrane passage fliat is 
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q>plicable to the SarA function inhibitors of the present invention. Furthermore, a world 
leading PNA researdi group has recently published a PNA conjugate that is readily taken 
up by cells (Ljungstrom et al.^ 1999). Further, another group found that polyarginine (9- 
mer) was very useful as a ''cellular pass" and increases the rate of uptake of many drugs 
S significantly (Service, 2000). Peptides are the simplest conjugates to prq)are because 
PNAs are synthesized by the same chemistry methods used to synthesize peptides. All of 
these efforts are aimed at human diseases and PNA uptake by human cells and applicable in 
treating Staphylococcus infections. Because the Staphylococd are taking up peptides and 
oligonucleotides as their colony is growing, coupling PNA and other inhibitors to peptides 

10 enhances the uptake of PNA by die bacteria. 

HP molecules are polymers of A^-methylimidazole and N-methylpyrrole amino adds 
that bind specifically in the minor groove of diq^lex DNA (Wemm^ and Dervan, 1997). In 
recent years HP& have been developed that can be designed to targ^ any DNA sequence 
(White et al., 1998). Like PNAs, HPs protect the DNA from DNase I digestion and 

IS interrupt normal DNA binding by proteins. Bodi Pol U and Pol m gene expression has 
been shown to inhibited by specific HPs (Gottesfeld et al., 1997; Dickinson et al., 1998). 
HPs currently have an apparent advantage over PNAs, namely that they are readily 
assimilated into cells (Gottesfeld et aL, 1997; Dickinson et al., 1998). Like PNA, HP 
molecules are not degraded enzymatically. 

20 In the present invention, PNA and HP molecules are synthesized diat are designed 

U> interrupt SarA-a^r interactions. Appropriate molecules are selected by in vitro tests first 
and then as inhibitors of exoprotein expression. Molecules that are found to be effective in 
these tests are selected for testing in animal models of staphylococcal disease. 

For PNA synthesis, standard procedures w^e followed (Goodwin et al.y 1998). 

25 Briefly, the F-moc protected monomm for PNA synthesis are purchased from PE 
Biosystems (formerly Perceptive Biosystems). Disposable polypropylene chromatography 
colimms are filled with polyethylene glycoKpolystyrene resin. The scale of the synthesis is 
dependent on the amount of resin utilized. Following resin washing successive cycles of 
monomer coupling, capping, and deprotection, the PNA is released from the resin by 

30 treatment with trifluoroacetic acid. Purification of PNA molecules is by reverse phase 
HPLC and the fidelity of the synthesis is chedced by NfALDI-TOF mass spectrometry. 
Currently, a standard PNA molecule is synthesized in one day, purified in the second day 
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and analyzed by mass spectrome^ on the third day. With this r^id turn-around, a 
reasonably large number of PNAs for inhibition of SarA-agr interactions are screened. 

A simple solid-phase synthesis of HP molecules has been reported (Baird and Dervan, 
1996). The present ^roach is to test the sites of inhibitor binding with PNA molecules 
S first, and those PNAs that are effective acted as a target for subsequent HP synthesis. For 
HP molecules only two monomers are required, each of which requires a 4 -5 step 
synthesis. These steps are straightforward, efficient and generate gram-quantities of highly 
purified material after column chromatogn^hy (Baird and Dervan, 1996). The Boc-Py-OBt 
ester and Boc-Im acid monomers are prepared first. Solid-phase synthesis of the HP 

10 molecules are performed as described by Baird and Dervan (1996). The details of the 
synthesis are similar to standard Boc-peptide syndiesis. The HP molecules are released 
from the solid support and purified by reverse phase HPLC. The fideli^ of the synthesis is 
diecked by MALDI-TOF mass spectrometry. 

The first tests of potential inhibitors are made using EMSA. Briefly, ^-labeled DNA 

15 fragments that correspond to the PNA or HP molecules are used as the targets. For 
example, a PNA designed to bind to the Al site (Figure 3) is tested using the A1/A2 DNA 
fragment. The concentration of SarA required to bind 50% of the DNA fragment is 
determined empirically. To determine the degree of inhibition, a mixture of SarA and ^P- 
labeled DNA that gives 50% free DNA and 50% bound DNA (called 100% in this assay) is 

20 titrated with the PNA molecule. The concentration of PNA that reduces the SarA-DNA 
conq>lex by 50% is the IC50 value. Using dtis sinq)le, but effective testing procedure, the 
PNA or HP design is varied with regard to the sequence targeted and the length of PNA or 
HP molecules. linhibitory PNA and HP molecules are tested against other binding sites for 
SarA, because the most effective inhibitors are those that can interupt SarA-DNA 

25 interactions at multiple sites. 

PNA and HP molecules that are effective in the EMSA assay are tested for inhibition of 
RNAm expression usmg 5. aureus strains RN6390, DB and UAMS-1. RNAIU expression 
is assessed by northern blot as previously described (see above). The expression of RNAIU 
and/or SarA-dependent target genes including a-toxin, lipase, and protein A also is tested. 

30 Briefly, strains are grown in tryptic soy broth to early log phase. Putative inhibitors are 
added at various concentrations and the cultures allowed to grow for various periods. 
Samples taken from mid-exponential and post-exponential phase cultures are processed by 
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centrifugation, with the siqiematant used for analysis of exoproteins (e.g. a-toxin and 
lipase) and the cell pellet used to isolate RNA and for the analysis of surface proteins (e.g. 
protein A). Production of a-toxin is assessed by quantitative hemolysin assay and by 
western blot (see above). Lipase production is detennined using a quantitative 
S spectrophotometric assay employing tributyrin as substrate (Smeltzer et al., 1992). Protein 
A production is be assessed by northern blot using a spa probe and by western blot using 
non-immune rabbit serum. Inhibitors of interest results in reduced production of RNAID, 
reduced production of a-toxin and lipase, and increased production of protein A. To 
confirm the specificity of the inhibitor, parallel cultures of WA400 (RNAm) and 
10 derivatives generated by transducing the WA4(X) RNAm mutation into DB and UAMS-1 
also are induded. 

A prototype PNA inhibitor of SarA-agrbmding is constructed and tested. The 
molecule has the sequence gly-i ri trri AACTA-lys, where the gly is a glycine amino acid 
on the amino terminus and the lys is a lysine amino acid on the C-terminus. These terminal 

IS amino acids are thought to increase the solubility of the PNA and perh2q)s increase the 
ability of the molecule to traverse the cell membrane. The TTTCTTAACTA sequence is 
identical to the 3' end of the A 1 half site, and shoiild hybridize to the lower strand in 
Figure 3. The PNA (PNA-1) was synthesized according to Goodwin et al. (1998) and 
purified by HPLC. The fidelity of the synthesis and confirmation of the structure of PNA-1 

20 was accomplished by mass spectrom^ry. The results show that the molecule binds to an 
oligonucleotide that includes die A1/A2 boxes (die A1/A2 oligo fnm Figure 3.) 20 nM 
A1/A2 was radiolabeled widi ^ and incubated with varying concentrations of PNA-1. 
The conq>lexes w^e resolved by native gel electrophoresis and d^ected by 
phosphorimaging. Figure IS shows the results of this representative expenmtot. The 

2S duplex DNA target and the heteroduplex of PNA-l-DNA are labeled. It is clear fi*om this 
experiment that 1) the affinity of PNA-1 for the A1/A2 oligonucleotide is very high, and 2) 
that nearly all, if not all, of PNA-1 is active in this assay. 

The ability of PNA molecules to conqpete for SarA binding to the Al site is shown 
in Figure 16. Radiolabelled dsDNA (lOnM) correspondmg to the A1/A2 bindmg site is 

30 mixed with either varying concentrations of PNA (right panel) or varymg concentrations of 
PNA (O.TSOpM, 7.S nM, 7S nM and 7S0 nM) and a fixed concentration of pure SarA 
(IQnM) is added to die solution after die A1/A2 bmdmg site (left panel) and die PNA has 
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been incubated together overnight in EMSA buffer. These e}q)eriments are performed 
according to the methods disclosed in Rectin et a/., 1999. The sanq)les were analyzed by 
standard EMSA and phosphorimaging. It is clear that the PNA molecule is able to 
outcompete SarA for binding to the DNA, and forms a protein-DNA complex. However, 
S increasing concentrations of PNA are able to conq>ete with SarA for binding the DNA and 
the SarA-DNA complex is diminished. The PNA molecule used in this experiment is an 
18-mer: gly-TCCAATTTTTCTTAACTA-Iys. 

To determine the tninifTwim length PNA molecule required in the present invention 
to inhibit SarA function, the 11-mer, gly-TTTCTTAACTA-lys (PNA), was incubated with 

10 a mixture of SarA and radiolabeled ds A1/A2 DNA, as disclosed above and in Recin et al., 
1999. The SarA and the A1/A2 DNA formed a complex, and the addition of the U-mer 
did not change this conq>lex. However, v/bea die 18-mer (PNA) described above was 
added to tfie conq>lexed SarA and radiolabeled ds A1/A2 DNA, the complex change from 
the SarA-DNA complex to the PNA-DNA conq>lex, suggesting diat the latter conqdex was 

IS more stable than the former. This result also suggests that the minimum length of a PNA 
molecule must be greater than 11 derived nucleotides to conq)ete out the binding of SarA to 
its binding site, and an 18-mer was of a sufficient length to displace the SarA from the 
DNA. The present invention provides a method to determine the minimum and/or optimum 
length of the SarA function inhibitor that is op^ble in the present invention. 

20 Antisense molecules (oligonucleotide analogs) to die sar gene or to the SarA bindmg 
sites on the agr locus are also inhibitors of the SarA function involved in the e}q)ression of 
virulence factors in staphylococcal infections. Most commonly, these inhibitors are 
relatively small RNA or DNA molecules because they can be designed to be highly 
specific. In general, so-called "antisense" molecules have a sequence which is 

25 complementary to a portion of the mRNA. 

As indicated, the antisense molecules can have a variety of chemical 
constitutions, so long as they retain the ability specifically to bind at the indicated control 
elements. Thus, especially preferred nK)lecules are oligonucleotide analogs -DNA, RNA, 
protein nucleic acids (PNAs) and phosphothiolate oligonucleotides. The oligonucleotides of 

30 the present invention are based, for example, upon ribonucleotide or deoxyribonucleotide 
monomers linked by phosphodiester bonds, or by analogues linked by methyl phosphonate, 
phosphorothioate, or other bonds. These are engineered using standard synthetic techniques 
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to very specifically bind the targeted control region(s). While these molecules may also be 
large, they are preferably relatively small, i.e. , correspondmg to less than about 50 
nucleotides, more preferably less than about 25 nucleotides. Sudi oligonucleotides may be 
prqpared by m^ods well-known in the art, for mstance using conmiorcially available 
5 machines and reagents available from Perkin-Ebner/Applied Biosystems (Foster City, CA). 

Hiosphodiester-Iinked oligonucleotides are particularly susceptible to the action of 
nucleases in serum or inside cells, and therefore in a preferred embodiment the 
oligonucleotides of the present invration are phosphorothioate or methyl phosphonate-linked 
analogues, which have been shown to be nuclease-resistant Stein et al. (1988). Persons 
10 knowledgeable in this field will be able to sdect other linkages for use in the present 
invention. 

To select die pr efe rred lengdi for an antisense oligonudeotide, a balance must be 
struck to gain die most fiivorable diaracteristics. Shorter oligonucleotides 10-15 bases in 
length readily enter cells, but have lower gene specificity. In contrast, longer 

15 oligonucleotides of 20-30 bases offer superior gene specificity, but show decreased kmedcs of 
uptake into cells. See Stein et al. (1988). In a preferred embodiment, this invention 
contemplates using oligonucleotides approximately 14 to 25 nucleotides long. 

PNAs, HPs and antisense molecules are delivered in a variety of ways. They 
are synthesized and delivered as a typical pharmaceutical, usually parenterally. They are 

20 formulated as detailed below, but one preferred formulation involves 

racq>sulation/association with cationic liposomes. They can be modified with a targeting 
sequence or optionally linked to a polyamine, such a polylysine. See Bachmann et al. , 
(1998) for one approach to delivering antisense molecules using a targ^ing sequence. 
Alternatively, antisense molecules are delivered using gene ^erzpy methods, detailed 

25 below. Usinggenethenq)y vectors, single, or multiple tandem copies of antisense 
molecules can be used. 

Administration of PNAs, HPs, and antisense oligonucleotides to a subject are effected 
orally or by subcutaneous, intramuscular, intraperitoneal, or intravenous injection. 
Pharmaceutical compositions of die present invention, however, are advantageously 

30 administered in the form of injectable compositions. A typical composition for sudi purpose 
conqprises a phannaceutically acceptable solvent or diluent and otiier suitable, physiologic 
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coiiqx>unds. For instance, tiie conqx>sition may contain oligonucleotide and about 10 mg of 
human senun albumin per milliliter of a phosphate buffer containing NaQ. 

As much as 700 milligrams of antisense oligodeoxynucleotide has been administered 
intravenously to a patient over a course of 10 days (Le.^ 0.05 mg/kg/hour) wiUiout signs of 
5 toxicity. Sterling (1992), 

All of the patent documents and publications cited herein are hereby incorporated by 
refnence in their entirety. 
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WE CLAIM: 

1. A method of treating a stsq)hylococcaI infection comprising administering to a 
subject having a staphylococcal mfection at least one inhibitor of SarA function involved in 
die egression of staphylococcal virulence genes. 

2. The method of claim 1, wherein said inhibitor interferes with SarA mediated 
activation of the agr locus. 

3. The method of claim 2, wherein said inhibitor interferes with the binding of SarA to 
at least a portion of the agr locus. 

4. The method of claim 3» wherein said portion of the a^r locus is composed of greater 
dian about 70% A-T nucleotides. 

5. The method of claim 4, wherein said portion of die agr locus is conq>osed of 
between about 79-89% A-T nucleotides. 

6. The method of claim S, wherein said inhibitor interferes with the binding of SarA to 
at least a portion of the agr locus dq>icted in Fig. 3. 

7. The method of claim 6, wherein said portion of the agr locus is at least one 
nucleotide sequence selected from the group consisting of at least the nucleotide sequences 
in the Al and A2 boxes, at least the nucleotide sequences in the Bl and B2 boxes and at 
least the nucleotide sequences in the CI and C2 boxes. 

8. The m^od of claim 7, wherein said uihibitor also mterferes with the binding of 
SarA to the intervening nucleotide sequences between the Al and A2 boxes when the 
inhibitor binds to the nucleotide sequence in the Al and A2 boxes, to the int^ening 
nucleotide sequences betweoi the Bl and B2 boxes when die inhibitor binds to the 
nucleotide sequences in the Bl and B2 boxes or to the intervening nucleotide sequences 
between the CI and C2 boxes when the inhibitor binds to the nucleotide sequences in the 
CI and C2 boxes. 

9. The method of claim 3, wh^in said inhibitor binds to at least a portion of the agr 
locus. 
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10. The method of claim 9» wherein said inhibitor is selected from the group consisting 
of an oligonucleotide analog of the Sar A binding site of die agr locus and a hairpin 
polyamide. 

11. The method of claim 10» wherein said oligonucleotide analog is selected from the 
groiq) consisting of a pq>tide nucleic acid molecule, a DNA molecule, a RNA molecule and 
a phosphothiolate oligonucleotide. 

12. The composition of claim 11, wherein said oligonucleotide analog or said hairpin 
polyamide is conjugated to a carrier that facilitates the movement of said oligonucleotide 
analog or said hairpin polyamide mto a st2q>hylococcal cell. 

13. The method of claim 1, wherein said inhibitor is admixed with a pharmaceutically 
acceptable carrier. 

14. A conq)osition comprising an inhibitor of SarA function in the e;q)ression of 
staphylococcal virulence genes and an acceptable carrier. 

15. The conq)osition of claim 14, wherein said acceptable carrier is a pharmaceutically 
acceptable carrier. 

16. The composition of claim 14, wherein said inhibitor interferes with SarA mediated 
activation of the agr locus. 

17. The composition of claim 16, wherein said inhibitor interferes with the binding of 
SarA to at least a portion of the agr locus. 

18. The composition of claim 17, wherein said portion of the agr locus is conq)osed of 
greater than about 70% A-T nucleotides. 

19. The composition of claim 18, wherein said portion of die agr locus is conq)osed of 
between about 79-89% A-T nucleotides. 

20. The conqx>sition of claim 19, wherein said inhibitor interferes with the binding of 
SarA to at least a portion of the agr locus depicted in Fig. 3. 
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21 . The coiiq)Osition of claim 20, wherein said portion of the agr locus is at least one 
nucleotide sequence selected from the group consisting of at least the nucleotide sequences 
in die Al and A2 boxes, at least the nucleotide sequences in the Bl and B2 boxes and at 
least the nucleotide sequences in the CI and C2 boxes. 

22. The conq)osition of claim 21, wherein said inhibitor also interferes with the binding 
of SarA to the intervening nucleotide sequences between die Al and A2 boxes when the 
inhibitor binds to the nucleotide sequence in the Al and A2 boxes, to die intervening 
nucleotide sequences between the Bl and B2 boxes when the inhibitor binds to the 
nucleotide sequences in the Bl and B2 boxes or to the intervening nucleotide sequences 
between the CI and C2 boxes vfbsa the inhibitor binds to the nucleotide sequences in the 
CI and C2 boxes. 

23. The conqx>sition of claim 17, wherein said inhibitor binds to at least a portion of the 
agr locus. 

24. The conq)osition of claim 23, wherein said inhibitor is selected from the group 
consisting of an oligonucleotide analog of the Sar A binding site of the agr locus and a 
hairpin polyamide. 

25. The composition of claim 24, wherein said oligonucleotide analog is selected from 
the group consisting of a pq>tide nucleic acid molecule, a DNA molecule, a RNA molecule 
and a phosphotfaiolate oligonucleotide. 

26. The conqM)sition of claim 25, wherein said oligonucleotide analog or said hairpin 
polyamide is conjugated to a carrier that fiicilitates die movement of said oligonucleotide 
analog or said hairpin polyamide into a staphylococcal cell. 

27. A method of identifying inhibitors of SarA function involved in die expression of 
staphylococcal virulence genes comprising: 

a) contacting a candidate inhibitor with at least one SarA binding site of the agr 
locus in solution to allow the binding reaction to equilibrate for a sufficient period of time; 
and 
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b) assessing the binding of said candidate inhibitor to said SarA binding site of 
the agr locus. 

28. The method of claim 27, further co^^>rising the addition of SarA to the solution of 
step a) simultaneously with said contacting of said uihibitor and said SarA bmding site, and 
assessing the binding afGnity of said candidate inhibitor relative to the binding afOnity of 
said SarA to said SarA binding site of the agr locus. 

29. The m^od of claim 27, further con^)rising the addition of SarA to the solution of 
step a) sequentially after said contacting of said inhibitor and said SarA binding site, and 
assessing the binding affinity of said candidate inhibitor relative to the binding affinity of 
said SarA to said SarA binding site of the agr locus. 

30. The method of claim 27, wherein said assessment of binding of the inhibitor to said 
SarA binding site is performed by an electrophoretic mobility shift assay or fluorescence 
anisotropy. 

31 . The method of claim 27, wherein said SarA binding site of the agr locus as depicted 
in Fig. 3 is at least one nucleotide sequence selected from the group consisting of at least 
the nucleotide sequences in the Al and A2 boxes, at least die nucleotide sequences in the 
Bl and B2 boxes and at least the nucleotide sequences in the CI and C2 boxes. 

32. A method of identifying inhibitors of SarA function involved in the expression of 
sts^hylococcal virulence genes comprising: 

a) contacting a candidate inhibitor with SarA in solution to allow said candidate 
inhibitor to affea the ability of SarA to biiKi to at least one SarA binding site of the agr 
locus; 

b) contacting said solution of step a) with at least one SarA binding site of the 
agr locus either simultaneously with said contacting of said inhibitor and said SarA or 
subsequent to said contacting of said inhibitor and said SarA; and 

c) assessing the inhibition of said candidate inhibitor on the SarA binding to 
said SarA binding site of the agr locus. 
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33. The method of claim 32, wherein said assessment of binding of the inhibitor to said 
SarA binding site is performed by an electrophoretic mobili^ shift assay or fluorescence 
anisotropy. 

34. The method of claim 32, wherein said SarA binding site of the agr locus as dq)icted 
in Fig. 3 is at least one nucleotide sequence selected from the groiqi consisting of at least 
the nucleotide sequences in the Al and A2 boxes, at least the nucleotide sequences in the 
Bl and B2 boxes and at least die nucleotide sequences in the CI and C2 boxes. 
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FIG. 5A 
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FIG. 5B 
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FIG. 6A 
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FIG. 6B 
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FIG. 7B 
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FIG. 16 
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